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Abstract 

A restrictive assumption in the work on testing for structural breaks in time series 
consists in the fact that the model is formulated such that the stochastic process under 
the null hypothesis of “no change-point” is stationary. This assumption is crucial to derive 
(asymptotic) critical values for the corresponding testing procedures using an elegant and 
powerful mathematical theory, but it might be not very realistic from a practical point of 
view. For example, if change point analysis for a particular parameter of the process (such 
as the variance) is performed, it is not necessary clear why other parameters (such as the 
mean or higher order moments) have to stay constant under the hypothesis that there is 
no change point in the parameter of interest. 

This paper develops change point analysis under less restrictive assumptions and deals with 
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the problem of detecting change points in the marginal variance and correlation structures 
of a non-stationary time series. A CUSUM approach is proposed, which is used to test the 
“classical” hypothesis of the form Hq '■ 6 i = 62 vs. Hi : 61 ^ 02 , where 0 i and O 2 denote 
second order parameters (such as the variance or the lag /c-correlation) of the process 
before and after a change point. The asymptotic distribution of the CUSUM test statistic 
is derived under the null hypothesis. This distribution depends in a complicated way 
on the dependency structure of the nonlinear non-stationary time series and a bootstrap 
approach is developed to generate critical values. The results are then extended to test the 
hypothesis of a non relevant ehange point, i.e. Hq : |0i — ^ 2 ! < <^5 which reflects the fact 
that inference should not be changed, if the difference between the parameters before and 
after the change-point is small. 

In contrast to previous work, our approach does neither require the mean to be constant 
nor - in the case of testing for lag fc-correlation - that the mean, variance and fourth order 
joint cumulants are constant under the null hypothesis. In particular, we allow that the 
variance has a change point at a different location than the auto-covariance. The results 
are illustrated by means of a simulation study, which shows that the new procedures have 
nice finite sample properties. The central England monthly temperature series are analyzed 
and significant change points in the variance and lag 1 -correlation are found in the winter 
monthly temperature at the late 19th century. 

AMS subject classification: 62M10, 62F05, 62G09 

Keywords and phrases: piecewise locally stationary process, change point analysis, relevant 

change points, second order structure, local linear estimation 


1 Introduction 


Change point analysis is a well studied subject in the statist i cal li terature. Since the semi¬ 
nal work on detecting structural breaks in the mean of iPagd (119541 ) a powerful methodology 
has been develop e d to detec t various types of chan ge points in time series [see for example 
Aue and Horvath ( 2013 ) and Ijandhvala et ah (j2013 ) for recent reviews of the literature]. Sev¬ 
eral authors have argued that in applications besides the mean the detection of changes in the 
variance or the correlation structure of a time series is of importance as well. Typical examples 
include the discrimination between stages of high and low asset volatility or the detection of 
changes in the T Jaram eters of an AR(p) model in order to obtain superior forecasting procedures. 


Wichern et ah 


(jl976f) studied the change point problem for the variance in a hrst order autore¬ 


gressive model. These authors pointed out that - even if log-return data exhibits a stationary 
behavior in the mean - the variability is often not constant and as a consequence any conclusions 
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base d on the assumption of ho moscedasticity could be misleading. lAbraham and Weil fjl984l) 
and iBaufavs and RassonI (119851) use d a Bayesian and an ML approach to find change points in 


the variance of an independent identically distributed sequence anc 

Lee and Pa 

( 

20011 derived 

corresponding results applicable to linear processes [see also 

Chen and Gupta 

(1997 

) wh 

o used 

the Schwarz information criterion]. Recentlv Galeano and Pena 

(2007 

) and 1 

\ue et ah 

(20091 


suggested n onparametric tests for s tructural breaks in t he variance matrix of a multivariate time 
series, while 


Davis et ah 


fl2nn6h and 


Breuss et ah 


(120141 ) proposed methods for detecting multiple 


breaks in piecewise stationary processes. 

This list of references is by no means complete but an important feature of the cited references 
and most of the literature on testing for structural breaks consists in the fact that the model 
is formulated such that the stochastic process under the null hypothesis of “no change-point” 
is stationary. This assumption is crucial to derive (asymptotic) critical values for the corre¬ 
sponding testing procedures using an elegant and powerful mathematical theory such as strong 


ler hand this assumpti on d rastically restricts 


Inclan and Tiao 

(1994 

) and 

Aue et ah 

(2009) 


approximations or invariance principles. On the ot 
the applicability of the methodology. For example, 
assume for the construction of a testing procedure for the hypothesis 

Hq : gI = O'j for alH, j = 1, ■ ■ ■ 5 versus Hi : af ^ for some i ^ j. 


( 1 . 1 ) 


of a constant variance of a time series that the mean of the sequence under consideration does 
not change in t i me (a s the variance under the null hypothesis). A similar assumption was made 
by 


Wied et ah 


(120121) in the context of testing for a constant correlation, where the authors 


suggested a CUSUM-type statistic for a change in the correlation of a stationary time series if 
at the same the means and variances do not change. However, from a practical point of view, 
assumptions of this type are very restrictive and there might be many situations where one is 
interested in a change of the variance (or the correlation) even if the mean (or the means and 
the variances) chang e gradually i n time. In this case the classical approach is not applicable 
any more. Recently, ZhoiJ (2013) investigated such a problem, in the context of testing for a 
constant mean, and demonstrated that the classical CUSUM approach yields to severe biased 
testing results if the assumption of (weak) stationarity (under the null hypothesis) is not satished. 
The situation gets even more complicated if one is interested in more sophisticated hypotheses 
such as precise hypotheses [see Berger and Delampadv ( 19871 )]. Here (in the simplest case) one 
assumes the existence of a change point k G {1,..., n} such that 


vi = al = ... = al ^ V2 = al^i = ... = al , 


( 1 . 2 ) 
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and is interested in hypotheses of the form 


Hq : A := \v 2 — Vi\ < 6 versus Hi : A := \v 2 — Vi\ > 6 


(1.3) 


for some pre-specihed constant 5 > 0. Throughout this paper we call hypotheses of the form 
lEB “classical” in order to distinguish these from the precise hypotheses of t he form fll.311. 
Altho ugh h ypothese s of th e form (11.311 have been discussed in other helds 


see 


Chow and Lin 


(119921 ) and iMcbridel (1199911] th e prob lem of testing precise hypotheses has only recently been 
considered by iDette and WiedI (12014 1 in the context of change point analysis. These authors 
point out that in many cases a modihcation of the statistical analysis might not be necessary if a 
change point has been identihed but the difference between the parameters before and after the 
change-point is rather small. In particular, inference might be robust under “small” changes of 
the parameters and changing decisions (such as trading strategies or modifying a manufacturing 
process) might be very expensive and should therefore only be performed if changes would have 
serious consequences. Testing hypothesi s of the f o rm (ll.3p to detect a structural break also avoids 
the consistency problem mentioned in BerksorJ ( 1938 1. that is: any test will d e tect n egligible 


changes in the parameter if the sample size is sufficiently large. 


Dette and Wie 


2 (1291 4I 1 


call the 


hypotheses of the form (II.3p hypotheses of a non relevant (null hypothesis) and relevant change 
point (alternative), and according to their argumentation only relevant change points should be 
detected, because one has to distinguish scientihc from statistical signihcance. 

Although the formulation of the testing problem in the form (11.31) is appealing, the construction 
of corresponding tests faces several mathematical challenges. In particular, one has to deal with 
the problem of non-stationarity (eve n under the null hypothesis of a non relevant change point). 
For example. 


Dette and 


WiedI ( 2014 1 developed a CUSUM-type test for the hypotheses in (II.3p . 


which is only applicable under the assumption that the time series before and after the change 
point is strictly stationary. From a practical point this assumption seems to be very strong and 
not very realistic. 

The present paper is devoted to the construction of change point tests for the second-order 


characteristics of a non-stationary time series, in particular changes in the varian ce anc 


/c-correlation. We consider piecewise locally stationary processes as discussed bv IZhonI (129131 ). 


the la g 


which are introduced in Section [2l Section |3] is devoted to the “classical” change point problem for 
the variance or lag fc-correlation of a piecewise locally stationary process. We propose a CUSUM 
approach based on nonparametric residuals and prove weak convergence of the corresponding 
CUSUM statistic. It turns out that the limiting distribution depends in a complicated way on 
the dependence structure of the piecewise locally stationary process, and for this reason a wild 
bootstrap approach is developed and its consistency proved. The methodology is very general 
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and applicable in many situations where the assumptions of classical tests are not satisfied. For 
example, in the problem of testing the “classical” hypothesis of a change in the lag fc-correlation 
we do neither assume that the mean, variance or higher order joint cumulants of the nonstation¬ 
ary sequence are constant nor that the change in the variance and the lag ^-correlation occur at 
the same location. Furthermore, we discover in this paper that the stochastic errors produced 
in the nonparametric estimation of the mean and variance function are asymptotically negligible 
in the second-order CUSUM statistic. The result is of particular interest and highly non-trivial 
because the order of the latter nonparametric errors are larger than the l/\/n convergence rate 
of the CUSUM test. 

Section 0] is devoted to the problem of testing the hypothesis of a non relevant change in the 
variance or lag /c-correlation. We use the CUSUM approach proposed in lDette and WiedI (120141) 
to obtain a test for the hypothesis (11.3p and its analogue in the case of lag ^-correlations. 
Asymptotic normality of a corresponding £ 2 -type statistic is established and a wild bootstrap 
method is developed, which addresses the particular structure of the hypotheses in relevant 
change point analysis. To our best knowledge resampling procedures for this type of change point 
analysis in non-stationary nonparametric problems have not been considered in the literature so 
far. The hnite sample properties of the new procedures are investigated by means of a simulation 
study in Section [5l In Section [6], we analyze the central England monthly temperature series 
and illustrate the usefulness of the proposed methodology in identifying second order change 
points in climate data. Finally, all proofs and technical details are deferred to an appendix and 
an online supplement, respectively. 


2 Piecewise locally stationary processes 

We start introducing some notations, which we frequently use throughout this paper. For a 
(real valued) random variable X and p > 1 we denote by ||W||p = (E|X|^)^/^ the Cp norm 
of X. The symbol —)■ means weak convergence of real valued random variables (convergence 
in distribution). For any interval X C M and nonnegative integer q dehne C^{X) as the set 
of q times continuously differentiable functions / : X —)■ M and C(X) = C°(X). Let {eijigz 
denote a sequence of independent identically distributed (i.i.d.) random variables and denote 
by X) = a(...,eo, ...,ej_i,ei) the sigma held generated by {ej\j < i}. We dehne the sigma held 
U-iXp U+1---X*)) where is an independent copy of {ejjjgz, and X* = X® 

for short. In the following discussion we will also make frequent use of the projection operator 
X,(-) = E(-|X,)-E(-|X,_i). 
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In this paper, we consider the model 


Yi = Mu) + e,, i = 


( 2 . 1 ) 


where (for the sake of simplicity) ti = i/n {i = 1,... ,n) and /i is a smooth function. Note that 
formally is a triangular array of random variables but we do not reflect this fact in our 

notation. Change point problems for this model have found considerable attention in the recent 
literature, where most of the work refers to problems of detecting a gradual change of the mean in 
the situation of zero mean and ind ependent iden tically distributed (i.i.d.) e rrors (even assumed 
to be Gaussian in some cases 
and 


see 


Mallik et ah 


Muller fjl992 ) for an earl y reference and 


Mallik et ah 


(1201 ih 

(120131 ) for more recent references]. Recently IVogt and Dettel (120151 ) proposed a 
;o det ect gradual changes in model (12. Ih using a different concept 


generalized CUSUM app roach 
of local stationarity [see IVogtl (120121 )] 


In the present paper we consider non-stationary processes of the form (12.11) and are interested in 
identifying abrupt changes in the second order properties such as the variance or the correlation 
at a given lag. More precisely we consider an error process in (12.ip . which is piecewise 

locally stationary (PLS) with r breaks for some r G N. Formally, we use the following definition 
for a PLS process and the concept of ‘^physical dependence measure for PLS ", which is given in 


Zhoul (I 2 OI 


Definition 2.1. 

(1) The sequence is called PLS with r break points if there exist constants 0 = Bq < bi < 

... < hr < br+i = 1 and nonlinear filters Go,Gi, ...,Gr, such that 


Ci Gjifi^Pi)^ bj ^ ti "P- bjjr\i 

where Pi = ..., ej_i, e*), and {ei}i^z is a sequence of i.i.d. random variables. 

(2) Assume that maxi<j<„ \\ei\\p < 00 for some p > 1. Then for fc > 0, define the kth physical 
dependence measure in Cp-norm as 

(5p(/c) = max sup \\Gi{t, Pk) - Gi{t, Pl)\\p, 

0 <i<r 


where 5p{k) =0 if k <t). 

For the asymptotic analysis presented later in this paper we list the following conditions: 

(Al) The process is PLS and piecewise stochastic Lipschitz continuous. This means 

that there exists a constant G > 0, such that for alH G {0,..., r} and all t,s E [bi, 6j+i] 
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the condition 


\\G,{t, J-o) - G,(s, J-o) \U{t -s)< C{t - s) 


holds, where l > 8 and C denotes a positive constant. In addition, E[ej] = 0 for all 1 < i < 
n, and we assume the existence of a strictly positive variance function : [0,1] —)■ 
such that af := = Var(ei) {i = 1,... ,n). 

(A2) The second derivative jl of the function /i in model (12.ip exists and is Lipschitz continuous 
on the interval [0,1]. 

(A3) maxo<i<r.suPig(fe, ||Gi(f, J'o)IL < oo for some l > 8. 

(A4) 6^{k) = O(x^) for some x ^ (0,1) and some l > 8. 

Remark 2.1. 

a) We emphasize that the bound of maxi<i<„ W^iWp in Definition 12.11 does not depend on n. 
This assumption is made in order to simplify the assumptions and the proofs in the subsequent 
discussion. It is also possible to develop corresponding results for an n-dependent bound with 
an additional complication in the technical arguments of the proofs and in the assumptions. 

b) Note that the process of squared errors is also PLS. Simple calculations show that 

{e'^}i=n satishes the assumptions (Al), (A3), (A4) with l > 4. 

3 Tests for changes in the second order structure 


Suppose that we observe data according to model fl2.ip . where the process is PLS 

and n is an unknown deterministic trend. We are interested in testing nonparametrically the 


“classical” hypothesis of a change point in the variance or 
differe nce to previous work on this subject [see for example 


(iS] 


he lag fc-correlation. T he important 


Inclan and Tiaol fjl994l) or 


Aue et al. 


is that in general the process is NOT assumed to be stationary under the null hypothesis 
of no change point. This means - for example - that the approach proposed here can be used to 
test the hypotheses fll.ip . where the mean is not constant. The price for this type of flexibility 
is that critical values of the asymptotic distribution of the CUSUM statistic are not directly 
available. For the solution of this problem we will develop a bootstrap CUSUM-type test for 
the “classical” hypotheses of a change point in the variance or lag fc-correlation, which is based 
on residuals from a local linear £t. For the dehnition of the local linear estimator we assume 
throughout this paper that the corresponding kernel function, say K, is symmetric with support 
[—1,1] satisfying f K(x)dx = 1, and define for 6 > 0 the function Ki,(-) = K{^). The moments of 
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K and K"^ are denoted by = J^x’‘K{x)dx and (j)i = J^x’'K'^{x)dx, respectively (/ = 0,1,...). 
We also assnme that K G 1,1]). 


3.1 Change point tests for the variance 

Onr first goal is to investigate the stability of the variances af = = Var(Fj) (* = 1,..., n) 

testing nonparametrically the “classical” hypotheses (ll.ip . For this purpose we consider the 
CUSUM statistic 


Tn = max 

l<i<n 


Si 


-S„ 


n 


(3.1) 


where Si = denotes the sum of squared nonparametric residuals ii = Yi — and 

is the local linear estimator of the function /i with bandwidth bn, that is 


= argmin^ (y* -bo- - t)YKb„{ti - t) 

boM 


(3.2) 


see 


Fan and Gijbelsl (119961) ] . Weak convergence of the statistic Tn/y/n follows under the addi¬ 


tional assumption 

(A5) The long run variance function 


OD 

cov(G-(f,J'fc),G-(t,J'o)) if t e {bi,bi+i], (3.3) 

k=—oo 


exists, Ki(0) := lim^o ^i(^) exists and inftg[o,i] niit) > 0. 

The following result provides the asymptotic distribution of T„. Its proof is complicated and 
therefore deferred to Section 17.1.11 in the Appendix. 

Theorem 3.1. If assumptions (Al)-(A5) are satisfied and nb^ 0, —)■ oo, then under the 

null hypothesis of no change in the variance we have 

-^fn A := sup \Ufit) - tufil)\, (3.4) 

A t6(0,l) 

where {t/i(t)}ig[o,i] is a zero mean Gaussian process with covariance function 

'y{t,s)= / Kl{r)dr. (3.5) 

Jo 
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It follows from the proof of Theorem 13.11 that the statistic has the same limit distribution 

as the statistic which is obtained if the nonparametric residuals Cj are replaced by the “true” 
errors e* from model (I2.1I1 . This observation is remarkable and highly non-trivial because the 
error from the nonparametric estimation is larger than 1 / ^/n. 

As a consequence of Theorem 13.11 we obtain - in principle - an asymptotic level a test for the 
hypothesis fll.ll) by rejecting Hq, whenever -^Tn > qi-a, where gi_Q, is the (1 — a)-quantile 
of the distribution of the random variable /Ci in fl3.4p . However, under non-stationarity (more 
precisely under the PLS assumption), the function defined in 03.31) and, as a consequence, 
the covariance structure of the Gaussian process {Ui(t) — fl7i(l)}tg[o,i] involves the complex 
dependency structure of the data generating process. Therefore it is very difficult to estimate 
the critical value qi-a of the asymptotic distribution of the CUSUM test statistic directly. As 
an alternative, a data-driven critical value will be derived in the following discussion using a 
wild boot strap metho d to mimic the distributional properties of the Gaussian process Ui{-). 
Following IZhoul (120131) we define for a fixed window size, say m, the quantity 


^Jm{n — m+1) 




m 




n 


SnjRj, i = 1, ...,n - m + 1, 


(3.6) 


where K is a sequence of i.i.d standard normal distributed random 

variables, which is independent of {ejjjgz. 

Theorem 3.2. Suppose that the conditions of Theorem AS . 1\ are satisfied. In addition, assume 
that m —)■ oo, m/n —)■ 0, mhf\o^ n —)■ 0 and —)■ 0. Then 

Tlbn. 


Mn = 


max 

m+l<2<n—m +1 




-<hn- 


n—m+l,m 


V 


/Cl 


n — m + l 

conditional on where the random variable /Ci is defined in (13.41) . 

Theorem 13.21 provides an asymptotic level a test for the hypothesis of a constant variance in 


model (12.ip . where the critical values are obtained by resampling. The details are summarized 
in the following algorithm. Some illustrations of this method are given in Section [5] and [6l 

Algorithm 3.1. 

[1] Galculate the statistic T„ defined in (13.11) . 

[2] Generate B conditionally i.i.d copies = 1; • • •+) of the random variables 

defined in (13.61) and calculate 


Mr = max 


- 

2,m 


<!>. 


(r) 


_ I -1 ^n—m+l,m 

n — m + l 
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[3] Let M(i) < M( 2 ) < ... < M(s) denote the order statistics of Mi,..., Mb- The null hypothesis 
of constant variance is rejected at level a, whenever 


Tn/Vn > MyB{l-a)\ ■ 

The p-value of this test is given by 1 — where B* = max{r : M(^) < Tn/\/n}. 


(3.7) 


Re mark 3 . 1. it follows by similar arguments as given in the proof of Theorem 2, Proposition 
3 of IZhoul (120131) . and Lemma 18.11 and Lemma 18.21 in the appendix of this paper that the 


bootstrap test (13.71) is consistent. In fact the test is able to detect local alternatives of the form 


a' 


'(•) = cr^ + n ^'^^/(•)) where /(■) is a nonconstant piecewise Lipschitz continuous function. 


3.2 Changes in the correlation 

In this section we consider the problem of testing whether there are changes in the correlation 
Pi^k '■= corr(Yi,Yi^k) for some pre-specihed lag k. Namely, we are interested in testing the 
hypotheses 


Hq : Pi,k = Pj,k for alH, j = 1,..., n versus Hi : pi^k 7 ^ Pj,k for some i 7 ^ j. (3.8) 


A test for the classical hyp othesis Hn : P±_= P 2 in stationary processes can be derived by 
similar arguments as given in IWied et ahl (120121 1 under the additional assumption that the mean 


and variance are not changing. However, statistical inference regarding changes the correlation 
structure in a general local stationary framework (including non constant mean or variance) 
requires estimates of the covariances and variances. For this purpose we consider two different 
estimators. First, let (J^(L) = b„(^i) denote the local linear estimates of af = cr^(fi) = Var(y)), 

which is dehned as 


= argmin > (e^ - ho- hi{ti - - f). 


bo,hi 


i=\ 


(3.9) 


where Ci = Yi — denote the residuals obtained from a £t of the local linear estimate with 

bandwidth This estimate is appropriate if there is no structural break in the variance. 

If this situation cannot be excluded, a more refined estimate for the function is required. 
To be precise, we also allow the variance to have a structural break at a point, say which 
does not necessarily coincide with the location of the change point in the lag fc-correlation. We 
assume that is Lipschitz continuous on the intervals (0,^.,;) and {ty, 1) and that there exists a 
constant C > 0, such that iy G [(C, 1 — C]. We dehne an estimator, say t*, of the change point iy 
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in the variance by 


t* = argmax \M{i)\, 

[nCJ <i<n-[nCJ+l 


where 


M{i) 


1 

L 


E 


j=i-L+l 






(3.10) 


(3.11) 


and L G N is a regularization parameter, which increases with n. N ote that the ma ximum in 
(13.1011 is n ot taken over the full range 1 < i < n as recommended in Andrew^ ( 1993 ) [see also 


Qu 


(i2008l )]. Finally, the second estimator of the variance function is defined by 


= S?(s)/(s < C) + 52 (s)/(s > Q , 


(3.12) 


where df and are the local linear estimator of the variance function from the samples 
Yi,..., Yint*] and Y\nti\+i-i ■ ■ ■ ,Yn, respectively. The following result shows that is a consistent 
estimate of t^. A proof can be found in Section ITTI 

Lemma 3.1. Assume that nb^ —>■ 0, nb^ —)■ cxo and that Assumptions (Al) - (A4) are satisfied 
with L > S. Suppose that the variance function is twice differentiable on the intervals (0,f^) and 
{t^,l), such that the second derivative is Lipschitz continuous (here ty is the location of the 
change point of the variance, which is defined as iy = 1 if there exists no jump). Let L = [n“J 
for A/i < a < 1/2, then the estimator defined in (13.10^ satisfies — iy = if 

< 1 . 

Remark 3.2. Observe that the lower bound for the parameter a in Lemma [3.11 converges to 0 
as t —>■ oo. Consequently, if 0 < < 1, the rate of convergence of the estimator is arbitrarily 

close to the optimal rate n~^ if Assumptions (Al) and (A4) hold for any t > 0. 

The lag /c-correlation at the point ti is estimated by a local average of the quantities of the form 


Wf = 




a 


m' 


(3.13) 


where d^ is either the estimate dehned in (13.9p (if a change point in the variance can be 

excluded) or the estimate a/f defined in (I3.12p . For convenience, we set e* = 0, whenever i > n, 
in the following discussion. The corresponding partial sum is denoted by 
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we consider the CUSUM statistic 


max 

l<i<n 



--s„ 

n 


w 


(3.14) 


In order to specify the necessary assnmptions for the asymptotic theory, recall the dehnition of 
the random variable in fl3.13p and dehne an nnobservable analogne by 


wt 


^ (ti) ^ 


(3.15) 


It is easy to see that is Ti+k measnrable and that the process is PLS. Dehne q 

as the nnmber of break points, 0 = Cq < Ci < ... < = 1 as the corresponding locations of 

the breaks and Hq, Hi, ... ,Hq as the corresponding nonlinear hlters, that is = Hj{ti, Hi+k) 
if Cj < ti < Cj+i. For the asymptotic analysis of the CUSUM-test we reqnire the following 
additional assnmption: 

(A6) The long rnn variance fnnction 


OO 

^lit) = COV {Hi{t,J'k),Hi{t, Ho)) if t e (ci,Ci+i], 

k=—oo 


exists, the limit ^2(0) = liniqo k^it) exists and inftg[o,i] > 0 - 

Theorem 3.3. Assume that bn —>■ 0, Cn/bn 0, —)■ 00 , 0, —)■ 0, nb'^Cn'^ 

OO and suppose that Assumptions (Al) - (A 4 ) and (A6) are satisfied with l >8. Assume that 
in fl3.13p is either the estimate defined in fl3.9p . if there is no change point, or the estimate 
af* defined in fl3.12p if there exists one change point, say i^, in the variance function. In the 
latter case let be strictly positive and twice differentiable on the intervals (0,f.y) and {in,l), 
such that the second derivative is Lipschitz continuous. Then under the null hypothesis of no 
change point in the lag k-correlation we have 



snp |f/ 2 (f) - fD 2 (l)|, 

i6(0,l) 


where {U 2 {t) | f G [0,1]} zs a centered Gaussian process with covariance kernel 


l{t,s) 



K2{r)dr. 


(3.16) 


In order to develop a consistent bootstrap test recall the notation fl3.4p . dehne 5]^ = ^ 
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and as in fl3.6p where Sn and Sj^m are replaced by and respectively. 


Theorem 3.4. Assume that the conditions of Theorem \S.A are satisfied. If m ^ oo, m/n —)■ 0, 
+ (7= + bl + log^ ^ 0, then 


MY = max 

m+l<i<n—m+l 


$ 


W 


-<h 


w 


V 


ICo 


^ ^ I 1 m+l.m 

n — m + 1 

conditional on Tn, where the random variable /C2 is defined in Theorem \3.3[ 


(3.17) 


Theorem 13.31 and 13.41 yield the consistency of the following bootstrap test for the hypothesis of 
a constant correlation with asymptotically correct type I error rate. The nnll hypothesis fl3.8p is 
rejected whenever 

(3.18) 

Here is the (1 — aj-quantile of bootstrap sample of the distribntion of the statistic 

defined in fl3.17p . which is generated in the same way as described in Algorithm 13.11 

Remark 3.3. 

(1) Assnme that nnder the alternative hypothesis the variance has at most one strnctnral break 

at the point t^ and that the second derivative of the variance fnnction is Lipschitz continuous 
on the intervals and {ty, 1). Then it can be shown by similar arguments as indicated in 

Remark 13.11 that the bootstrap test (I3.18p is able to detect local alternatives converging to the 
null hypothesis at a rate 

(2) In principle one could always work with the estimator defined in fl3.12p . However, a 

numerical study indicates that in cases where there is in fact no change point in the variance the 
estimator dehned in (I3.9p has a better hnite sample performance. Therefore we strictly 

recommend its use, if a structural change in the variance can be excluded. 

Remark 3.4. Although the method described so far refers to the problem of detecting a 
change point in a particular lag fc-correlation, it can easily be extended to the problem of testing 
for a change in one (or more) correlations simultaneously. We illustrate extensions of this type 
exemplarily for the problem of testing the hypotheses 

Ho ■ Pi,k = Pj,k = Pk for alH, j = 1,..., n, fc = 1,..., g (3.19) 

Hi : there exists 1 < A; < g and i j such that pi^k 7^ Pj,k ■ (3.20) 

For this purpose recall the notations fl3.15p and fl3.13p and define the vectors Wj = {Wfi ..., 
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and Wj = iWl ^..., . For some norm || • || on we consider the statistic 


j 1 

t"= max VWi--VWi 


i=l 


2=1 


Observe that {Wj}jgi^ is a PLS process, with nonlinear filter fnnction, say W, and (nnknown) 
£ break points Q = do < di <...,< < d^+i = 1. Assnme that the conditions of Theorem 13.31 

are satisfied, where assnmption (A6) is now replaced by the condition 

(A6*) The long run variance function 

CO 

~4{t) = Y. (W,(t, j-fc), w,(f, j-o)) e if f e (di,d.+i], 

k=—oo 

exists. Let \{t) = denote the smallest eigenvalue of the matric k^(f), then 

lim^o A(f) exists and inf^gjo^j X(t) > 0. 

Under these assumptions it can be shown by similar arguments as given in the proof of Theorem 
13.31 that 

1 t: A sup ||U2(t)-tU2(l)|l, 

te(o,i) 

where {U2(f)|t G [0,1]} is a g-dimensional centered Gaussian process covariance kernel 'y{t, s) = 

jmin(t,s) ^ ^qxq_ 

A similar result can also be derived for the new bootstrap procedure. To be precise define the 
vectors 

i = q + l,...,n — m — q + 1, 

(3.21) 


< 1 > — 

^ 2. m 


m(n — m — 2q + 1) 




j=q+l 


m 


■’j,m 


n 




where S,,^ W, and {Ri}i^z is a sequence of i.i.d standard normal 

distributed random variables, which is independent of {ejjjgz- If the conditions of Theorem 13.41 
hold [where assumption (A6) is again replaced by (A6*)], then we have (conditional on 


= 


max 


* _ 

^ 1 m 


n 


m 


2q + l 


^ n. —m— 


m—g+l,m 


V 


sup 

46 ( 0 , 1 ) 


|U2(«)-(Uj(l)||. 


These results show that the bootstrap test fl3.18p can easily be extended to discriminate between 
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the hypotheses (13.191) and (13.201) . 


4 Relevant changes of second order characteristics 

After a change point has been detected and localized a modihcation of the statistical analysis is 
necessary, which a ddresses the diffe r ent fe atnres of the data generating process before and after 
the change point. iDette and WiedI (120141) pointed ont that in many cases snch a modihcation 
might not be necessary if the difference between the parameters before and after the change 
point is rather small. On the one hand, inference might be robnst with respect to small changes 
of the variance or correlation structnre. On the other hand, changing decisions (snch as trading 


strategies or modifying a mannfactnring process) might be ver y expensive and o n 


if changes would have serious consequences. For these reasons iDette and WiedI (120141 ) proposed 


V be performed 


to investigate the hypothesis (II.3p of a non relevant change point, which will be discussed in 
this section for the variance (Section 14.ip and lag fc-correlation (Section 14.2p in a general non¬ 
stationary context (more precisely under the assumption of PLS). 


4.1 Relevant changes in the variance 

First we investigate the problem of testing for a non relevant change in the variance of a time 
series. Recall the definition of model (12.ip and that af = u‘^{ti) = Yax{Yi) is the variance of 
the response at the point ti. Throughout this section we assume the existence of some fixed but 
unknown point t^ G [0,1] such that the variance function in Assumption (Al) is constant on 
the intervals (0,t^) and {iy, 1) and that the variances af satisfy (II.2p with k = For some 


a non relevant c 


Dette and WiedI ( 20141 ) 


range 


pre-specified 5 > 0, we are interested in testing the hypothesis (II.3p o 
in the variance. Problems of this type have recently been discussed in 
under the assumption that the process before and after the change point is stationary and that 
additionally the mean of the process is constant (even if there is a change of small order in the 
variance). From a practical point of view, assumptions of this type are not very satisfactory and 
in this section we will propose a procedure for detecting relevant changes which does not require 
such strong assumptions. 

It turns out that a test for the hypothesis (11.31) needs an estimator of the change point, and we 
could use the estimator defined in (I3.10p for this purpose. This estimator requires the choice 
of the regularization parameter L. However, in the present context such a complicated estimator 
is in fact not necessary, because the variance before and after the change point is assumed to be 
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constant. Therefore we introduce the alternative estimator 


t 


n 


argmax (Sm-—Sn) , 

l<m<n ^ 12/ 


(4.1) 


where Sm = denotes the mth partial sum of the squared residuals e-i = Yi — 

obtained from a local linear £t. Estimators, maximizing t he CUSUM-type statis tics have been 
widely studied in the situation of stationary processes [see Ijandhvala et ^ ( 2013 )]. and it turns 
out that the statistic in has better hnite sample properties than the estimator dehned in 
fld.inp . if the variance function before and after the break point ty is in fact constant (which is 
our basic assumption throughout this section). 

Let A = V 2 — Vi denote the “true” difference before and after the change point. Our hrst result 
establishes the asymptotic properties of the estimator tn under the PLS assumption and is proved 

in Section [721 


Lemma 4.1. Assume that bn 0, oo, nh^ —)• 0 and that Assumptions (Al) - (A5) are 

satisfied. Suppose that the variance function is strictly positive and constant on the intervals 
(0,f.y) and (ty, 1) (here iy is the location of the change point, which is defined as iy = 1 if there 
is no jump). The estimate tn defined in (14.11) has the following properties: 

(i) If A = 0, then tn converges weakly to a [0, l]-valued random variable. 

(a) If A 0, then \in — iy\ = Op{n~°‘) for some a > 1/2. 


We now use the statistic (14.11) to dehne estimates of the variance before and after the change 
point, that is 


^n,l — 


\nin\ 




V 


An,2 — 


n - [ntn\ 




-'V 


j=\nin\+l 


and denote by = A„ 2 ~ A„ 1 an estimator of the difference A. Using similar arguments as 
given in the proof of Lemma [4.3! in Section 1721 it follows that A„ — A = Op(l^^). In order to 
construct a test for the hypothesis (II.3p we now consider the statistic 


f 



(4.2) 


where the process Un is the CUSUM process dehned by 


Un{s) = 


1 

n 



n 



j=l 
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The following result establishes the asymptotic properties of the statistic Tn- The proof is omitted 
because it follows by similar but easier arguments as given in the proof of Theorem 14.31 below, 
where we investigate the problem of testing for non relevant changes in the lag ^-correlation. 

Theorem 4.1. Suppose that the conditions of Theorem \3. 1\ hold. 

(i) // A 7^ 0, then 

- A2 ) a Zi(A) := ^ /Vi(s) - sf/i(l)) A - S A Q\A\ds, (4.3) 

where {t/i(s)}sg[o,i] denotes the Gaussian process defined in Theorem \3 . 1\ 

(a) If A = 0, then T„ = Op{l/n). 

It follows from Theorem 14.11 that an asymptotic level a test for the hypothesis fll.3p of a non 
relevant change in the variance is obtained by rejecting the null hypothesis, whenever 

tn>6^ + (4.4) 

Vn 

where Vi^a denotes the (1—a)-quantile of the distribution of the random variable ZfiS) dehned in 
(I4.3p . We note that this distribution is a centered normal distribution with a variance depending 
on the data generating process in a complicated way, in particular on the long run variance 
defined in fl3.3p . In order to circumvent this problem we will develop a resampling procedure to 
obtain critical values, where we have to address the particular structure of the hypothesis fll.3l) 
of a non relevant change point in the variance. To be precise, define 

Vj = e]- AJ{i > [nin\). (4.5) 

Let denote a sequence of i.i.d. standard normal distributed random variables, which 


is independent of define = X]r=7 ^ 

S'fi = ^ consider the 

random 

variables 



... n—m+1 



\/m{n — m+l) 

(4.6) 


The following result provides a bootstrap approximation for the distribution of the random 
variable Zfil). The proof follows by similar but easier arguments as given in Section [TJ where a 
corresponding statement is proved for change point tests in the correlation structure. 
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Theorem 4.2. Assume that the conditions of Theorem \4-I] are satisfied. In addition, assume 
that m —)■ oo, m/n —)■ 0 , mbfilog^ n —)■ 0 and —)■ 0 , then 


nb' 


M„ = 


6 


ntl{l-tnfi 


j : 


_^_ 

n-m + 1 


n—m+l,7n 


m+l<2<n—m+1 

conditionally on IFn, where Zi{l) denotes the random variable defined in 


(itn i ^ ~ 
\ n n 


2 i(i) 


We summarize the bootstrap test for the hypothesis fll.31) of a non relevant change in the variance 
structure in the following algorithm. 


Algorithm 4.1. 

[1] Calculate the statistic T„ dehned in 04.21) . 

[2] Generate B conditionally i.i.d copies = I 5 2, of the sequence 

dehned in 04.6p and calculate 


M^ = -^ -^ , 

ntl{l-tnfi 


E 


m+l<i<n—m+l 


<|)W^ _ 

2,m 


n 


m + 1 


71—m+l,m 


itn 

n 


- Atj 

n 


[3] Let < ... < denote the order statistics of Mfi,... ,M^. Reject the null 

hypothesis 01 .3p of a non relevant change in the variance at level a if 


f„>S^ + (4.7) 

The p-value of this test is given by 1 — where B* = max{r : H —< T„}. 

Remark 4.1. It is of interest to investigate the power of the tests 04.4p and 04.7p . For this 
purpose note that it follows for A 7 ^ 0 from 04.3p 


finiS,A)=¥{Tn>6^ + 


s-2 , '^l—ad 


n 


^fn-A^ ^ 

= PI \/n -r-TT- > \/n -rrr-^-h 


|A| 


|A| |A| 


1 - , + 


|A| 


|A| 


(4.8) 


where 4/ is the distribution function of the random variable Zfil) (in fact a centered normal 
distribution). Therefore, under the alternative of a relevant change A^ > 5^, we have A) —)■ 
1 as n ^ cx), which provides the consistency of the test 04.4p . On the other hand under the null 
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hypothesis 0 < < <5^ we have 


lim Pn(,S,A) 

n—¥oo 


0 if 0 < A2 < ^2 

a if A2 = ^2 


If A = 0, then = Op{l/n) and lim^^oo/9n(^; A) = 0, which means that the test fl4.4p has in 
fact asymptotic level a. 

We can also use fl4.8ll to investigate the power as a function of the parameter 6 in the hypothesis 
fll.dp . For example, we can see that for sufficiently large n the power l3n{^,A) is approximately 
1 if 5 —)■ 0, and A) is approximately 0 if 5 —)■ cxd. Moreover, it is easy to see that all 
statements mentioned in this remark hold also for the bootstrap test defined by fl4.7p . 


4.2 Relevant changes in correlation 

Consider model fl2.ll) . denote by pi^k = corr(yj, "Fj+fc) the correlation at lag k and suppose that 
for some unknown t G (0,1) 

Pi Pl,k ••• Plnt\,ki P2 P[ntJ+l,fc ••• Pn—k,k- (^•^) 

In this section we are interested in the problem of testing the hypothesis of a non relevant change 
in the correlation at lag fc, that is 


Ho ■. \pi - P 2 \ < 5 versus ifi : |pi - P 2 I > ^ 


(4.10) 


for some pre-specified 5 > 0. iDette and WiedI (120141) provided a method for testing the hy¬ 


pothesis fld.lOj) under this and the additional assumption that the process before and after the 
change point exhibits a stationary behaviour. However, in general local stationary framework 
the construction of a test is more difficult and will be explained in the following paragraphs. 
We denote by A = p 2 ~ Pi fhe (unknown) difference before and after the change point and 
assume throughout this section that under the null hypothesis of a non relevant change in the 
correlation the variance function cr2 has either no jumps or has a jump at a point, say ty, which 
does not necessarily coincide with the change point t in the correlation structure. In order to 
estimate the correlation consistently before and after the change point we recall the definition 
of the variance estimator fl3.12l) . which addresses the problem that the variance function before 
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and after the change point is not constant. We dehne the CUSUM process 



(4.11) 


where e-i = Yi — denotes the nonparametric residnals from the local linear £t and we use 

the convention that e* = 0 for i > n. The estimator for the change point of the correlation 
structure is hnally defined by 

in = argmax . (4.12) 

l<m<n 

Note that the statistic in depends on the estimator t* for the change point in the variance, which 
is defined in fl3.10p . The first result of this section establishes consistency of this estimate. 

Lemma 4.2. Suppose that the conditions of Lemma IJ.il are satisfied, and that the conditions 
for the bandwidths bn and Cn of Theorem \3.fA hold. The estimate tn of the change point in the 
correlation structure defined by fl4.12p satisfies 


(4.13) 

(4.14) 

for some a > 1/2, where T^ax is a [0, l]-valued random variable. 


in Tmax, if |A| = 0, 
\in -t\ = Op(n"“), if |A| > 0, 


The test for the hypothesis of a non relevant change will be based on the statistic 

Q /•! 


t: = 


«?.(! - Jo 


(£;W(s))^ds, 


(4.15) 


where the the proeess Un is defined in 114.1 1 1I . The following theorem shows that is a 

consistent estimator of and also provides its asymptotic distribution. 


Theorem 4.3. Assume that the conditions for the bandwidths bn and Cn of Theorem \3.3[ hold, 
and that Assumptions (Al) - (Af) are satisfied with l > 16. Suppose further that the variance 
function is strictly positive, twice differentiable on the intervals (0,f^) and (ty,l), such that the 
second derivative is Lipschitz continuous (here t^ is the location of the change point, which is 
defined as iy = 1 if there is no jump). If there exists a break, then we assume additionally that 
iy G [C; 1 “ C] for some constant C > 0. 
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i) If A ^ 0, then 

- A^) A Z2(A) : = 


6 


Jo 


[U2{s) - sU2{l)][st - s A t]|A|cis, 


(4.16) 


where the Gaussian process {[/ 2 (s)}se[o,i] defined in Theorem AS.tA 
a) If A = 0, then Tf = Op{l/n). 


A careful inspection of the proof of Theorem 14.31 shows that the statement 04.161) remains correct 
for any estimator of the change point in the correlation structure, which satishes 04.131) and 
04.14p . Moreover, Theorem 14.31 yields an asymptotic level a test for the hypothesis 04.101) of a 
non relevant change in the correlation structure by rejecting Ho, whenever Tf> 5"^ + 

Vi-a denotes the (1 — a)-quantile of the distribution of the random variable Z 2 {d) dehned in 
04.16p . We note again that this distribution is a centered normal distribution with a variance 
depending on the data generating process in a complicated way. In order to provide a consistent 
bootstrap approximation of the distribution of the random variable ^ 2 ( 1 ), recall the dehnition 
of the estimator tn of the change point in the correlation structure in fl4.12|) . We consider the 


statistics A„ 1 = 

[ntn I ^.7 = 1 




, Ayi^2 — 


■y-^ n—k 




-[nLi ^j=[ntn\+l 


and dehne 


An An^2 An^l 


(4.17) 


as an estimator of the difference A = p 2 — pi- The next lemma provides consistency of A„ and 
is proved in Section 17.21 


Lemma 4.3. Suppose that the conditions of Theorem |.^.3| and Assumption (A6) are satisfied, 
then An-A = 0,C^). 


Dehne 


~ - LAJ), 


(4.18) 


where the variance estimator is given by fl3.12p . and let {Rj}j^z be a sequence of i.i.d. standard 
normal distributed random variables, which is independent of {IFi}i^z- We introduce the partial 
sums Sf^ = =YJl-=i-^r and dehne 


n—m+1 


<|)^ = 
2,m 


^Jm{n — m + 1 ) 




(4.19) 
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then the following result is proved in Section 17.31 


Theorem 4.4. Suppose the conditions of Theorem |^.g| hold and that m oo, m/n —)■ 0, 
+ ( 7 = + bl + logn ^ 0 , then 


m: = 


6 


ntl{l-tnf 




. 

i.m 


m+l<i<n—m+l 


% ^ A \ f ^ 


V 


£ 2 ( 1 ) (4.20) 


conditional on Tn, where the random variable ^ 2 ( 1 ) is defined in Theorem f.3 


Theorem 14.41 provides a consistent asymptotic level a bootstrap test for the hypothesis of a non 
relevant change in the correlation structure. The hypothesis fld.lOp is rejected, whenever 




(4.21) 


Here is the (1 — aj-quantile of bootstrap sample of the distribution of the statistic 

Mf dehned in fl4.20p , which is generated in the same way as described in Algorithm 14.11 The 
consistency and the properties of the power function of the bootstrap test follow by similar 
arguments as given in of Remark l4.1l for the test of non relevant change in variance. The details 
are omitted for the sake of brevity. 

Remark 4.2. The proposed method can easily be generalized to address the problem of testing 
for a relevant change in several correlations simultaneously. Exemplarily we illustrate such a 
generalization in the situation, where one is interested in detecting a relevant change in any of 
the lag 1- to lag g-correlations. Consider model (12.ip and suppose that there exist time points 
tk G ( 0 , 1 ) such that 


Pl(^) Pl,k P[ntif.\,kj p2{fif P\nt)f.\ + l,k Pn—k,k 

(1 < fc < g). We are interested in testing the hypotheses 


Hq : \pi{k) - P 2 {k)\ < 4 for all /c = 1 ,... ,g 

Hi : there exists a lag A; G { 1 ,..., g} such that |pi(A:) — P 2 {k)\ > 4 , 


where ^i,..., 4 are given thresholds. For each lag fc, let tn,k denote the estimator for tk defined 
in (I4.12P and define 



(4.22) 
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where is given by fl4.1ip . Recall the dehnition of the g-dimensional process U 2 (t) = 
( 1 / 2 , 1 , ■ ■ ■ 5 in Remark 13.41 and snppose that the conditions of Theorem 14.31 and (A6*) 

hold. Then it can be shown 


max |v^(T; 

l<i<q 


A' 


V 


Z* = max 

l<i<q 


u\i-uy 


[h' 2 ,i(s) - sU 2 ,i{l)][sti - s A ti\\Ai\ds 


where = pi{k) — P 2 {k) (1 < A; < q). Similarly, bootstrap methodology can be developed 
considering consistent estimates, say Afc,„, of A^. Let 





> \nik\), 


(4.23) 


and consider the vector Aj = ..., Fnrther, define ^ A-r, = 

K and 


n—m+l 


i.m 




- V 

+ 1 ) 


j,m 


^n—Q 

n ^ 


Ri 


(4.24) 


{i = q + 1, ...,n — m — q + 1), where {Rj}j^z is a seqnence of i.i.d. standard normal distribnted 
random variables, which is independent of Let be the kth entry of (1 < A: < g) 

and dehne 






E 


m-hl<i<n—m+l 


- 

i.m 




n 


-$ 

I 1 I.m 

m + 1 




I ^ I ^ 

tn,k A tfi k 
n n 


(4.25) 


T) 

Then (conditional on Rn) niaxi<fc<g |AfcM^^| —> Z*, which provides the consistency of a corre¬ 
sponding bootstrap test. 


5 Finite sample properties 

In this section we investigate the finite sample properties of the proposed tests by means of 
a simnlation stndy. In all examples considered we used the function p(t) = 8 (—(A — 0.5)^ -|- 
0.25) as mean function and a sequence of independent identically normal distributed random 
variables {ej}j^z in the definition of the errors e* = Gj{i/n,Ri) in model fl2.ip . where Ri = 
a{... ,eo, ■ ■ ■ ,Si). The dependency structures differ by different choices for the nonlinear hlter 
Gj. The sample size is n = 500 and all results are based on 2000 simulation runs. In each run. 
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Figure 1: 


Typical sample paths of the processes corresponding to model I-VI. 


the critical values are generated hy B = 2000 bootstrap replications. 


5.1 Change point tests for the variance 


In this section we investigate the finite sample properties of the bootstrap tests for the “classi¬ 
cal” hypothesis fll.ip of a constant variance and for the null hypothesis fll.Op of a non relevant 
change in the variance. It turned out that for change point analysis of the variance the Mini¬ 
mal Volatility (MV) method for the selection of the bandwidth provided slightly better results 
than t he commonly used cross validation. The MV-method has been advocated in iPolitis et ah 


(119991) . To be precise consider a sufficiently wide interval (a, 6) C (0,1) and define the points 
di := a+{i — l){h — a)/l G (a, 6), i = 1,2,...,/ as potential bandwidths. For each di, we calculate 
the statistic Tn{i), which is defined as the test statistic T„ in (13.ip with bandwidth = di. We 
then calculate for i = 4,...,/ — 3 the standard errors SD{i) := sd{Tn{j),i — 3<j<i-\-3) of 
different subsamples of these statistics. Finally, the bandwidth bn in the interval (a, b) is chosen 
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as bn = di*, where i* = axgmin^^ ■ SD{i). 

In the following we discuss three models for the hlters of the innovations in model fl2.ip . 

(I) G{t,J^i) = H{t,Ti)/A, where + e* for t < 0.5, and = 

—0.5H{t, + Si for t > 0.5. 

(II) G{t,J^i) = iJ(f, J'i)A/r^^a2^/4, where and a{t) = l/A + t/2. 

(III) G(f, J)) = for t < 0.5, and ^(f, J)) = i/(f, J)) v/2(l - 62(t))/8 for 

t > 0.5. Here ^ and H(t,J^i) = 

t > 0.5, a{t) = 1/4 + f/2 and b{t) = 0.5 — {t — 0.5)^. 

Model (I) is a piecewise stationary process. The correlation has a structural break at the point 
t = 0.5. Model (II) is a locally stationary process. The MA coefficient is smoothly time-varying. 
By our construction, the variance of model (I) and model (II) remains constant, which means 
that both models correspond to the “classical” null hypothesis in fll.ip . Model (III) is a piecewise 
locally stationary process. The correlation of model (III) has a jump at the point t = 0.5. The 
MA coefficients before and after the point t = 0.5 are smoothly varying. The variance of model 
(III) also has a jump at f = 0.5 and remains constant before and after the jump, respectively. 
This model corresponds to the null hypothesis in fll.3l) of a non relevant change point in the 
variance. Typical trajectories of the processes corresponding to model (I) - (III) are displayed 
in the upper part of Figure [H 

For model (I) and (II) we investigate approximation of the nominal level of the bootstrap test 
fl3.7p for the classical hypothesis fll.ip of a change in the variance over the interval [0,1]. The 
corresponding rejection probabilities are displayed in the left and middle column of Tabled! We 
display the simulated type I error using different bandwidths in the interval (0.025, 0.3) and the 
bandwidth calculated by the MV-method (last line). We observe that the results are rather 
stable with respect to the choice of bn- Only if < 0.1 the level is overestimated. In particular 
the bandwidth calculated by the MV-method yields good results for both models. 

For model (HI), we are interested in testing the hypothesis fll.3l) of non relevant changes in 
variance, where the threshold is given by 5 = 1/64. The corresponding rejection probabilities 
are shown in the right column of Tabled! for the case that A = 6 = 1/64. Note that this choice 
corresponds to the boundary of the null hypothesis, and according to Remark 14.11 the nominal 
level is smaller in the interior of the null hypothesis, i.e. A < 6 (these results are not displayed for 
the sake of brevity). Interestingly - compared to the problem of testing the “classical” hypothesis 
fll.ip - the method is more sensitive with respect to the choice of the bandwidth. However, the 
MV-method yields a rather accurate approximation of the nominal level. 
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Table 1: Simulated Type I error of the tests fl3.7l) and fl4.7p for a change in the variance for 
various bandwidths and the bandwidth calculated by the MV-method (last line). Left and middle 
column: test for the hypothesis fll.ip (model (I) and (Id))- Right column: test for the hypothesis 
fll.Sp (model (HI))- 


model 

I 

II 


III 

bn/ a 

5% 

10% 

5% 

10% 

5% 

10% 

0.025 

19.15 

31.55 

30.45 

44.9 

2.45 

4.95 

0.05 

00 

bo 

16.9 

15.2 

25.9 

5.4 

8.7 

0.075 

5.7 

13.1 

10.25 

18.75 

7.55 

11.9 

0.1 

6.05 

12.8 

8.85 

16.3 

8.15 

12.7 

0.125 

4.75 

10.95 

7.95 

15.2 

7.4 

12.15 

0.15 

5.55 

11.9 

7.3 
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5.2 Change point tests for a lag /c-correlation 

We now investigate the same properties of the tests for changes in the lag 1-correlation. For this 
purpose we consider the following models. 

(IV) G{t,Ti) = (^-0.5)V4, where H{hTi) = 0.377(f, + £*. 

(V) G{t,iFi) = Hit, dFi)^c{t)/A for t < 0.5, and G{t,Hi) = H{t, Hi)\/d{t)/A for t > 0.5, where 
c{t) = 1 — {t — 0.5)^, d{t) = 1 — I sin f and H{t, Hi) = 0.3H{t, Hf) + Si. 

(VI) G{t,Hi) = H{t, Hi)^l — {t — 0.5)^/8, where H{t,Hi) = 0.5i7(f, J^i_i) + Si for t < 0.5, and 
Hit, Hi) = 0.7H{t, Hi-i) + Ei for t > 0.5. 

Model (IV) is a locally stationary processes. The variance of the process is time-varying, but 
the correlation remains constant. Model (V) and model (VI) are piecewise locally stationary 
processes, where the variance has a change point. Before and after the jump, the variance varies 
smoothly. The correlation of model (IV) and (V) is constant, while the correlation of model (VI) 
has a break at f = 0.5. Typical trajectories corresponding to these processes are depicted in the 
lower part of Figure [H 
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Note that the change point analysis by the tests proposed in Section 14.11 and 14.21 reqnires the 
choice of two bandwidths in the local linear esti mates of the mean a nd variance. We nse a 
generalized cross validation method introdnced bv IZhou and Wd (120101) to select the bandwidth 
for estimating the mean fnnction. Then we apply this cross validation procednre again to select 
the bandwidth for estimating the variance fnnction. The parameters L and C, in the estimator 
flO.lOp are chosen as L = and ( = 0.016, respectively. 

The corresponding rejection probabilities are displayed in Table [2] for varions bandwidths bn in 
the interval (0.025,0.3). At each fixed bn, the bandwidth for variance Cn is calcnlated by cross 
validation. The resnlts for the bandwidths calcnlated by cross validation are displayed in the last 
row of the table. The left and middle colnmn correspond to the “classical” hypothesis whether the 
correlation remains constant (model (IV) and (V)). We observe a very accnrate approximation 
of the nominal level if the bandwidth is chosen snch that bn < 0.1. If bn G (0.1, 0.3) the nominal 
level is slightly nnde restimated. For both models the generalized cross validation proposed by 


Zhou and Wul (120101) yields a rather accurate approximation of the nominal level. 


The right column of Table |2] shows the simulated type I error of the test (I4.10p for a non relevant 
change in correlation with 6 = 0.2. Again the case 6 = A = 0.2 is displayed in model (VI) 
corresponding to the boundary of the null hypothesis, and the simulated rejection probabilities 
are smaller if |A| <6. Compared to the test for a non relevant change in the variance (see Table 
H]) the test for a non relevant change in the correlation is rather stable with respect to the choice 
of the bandwidth. Also the proposed cross validation methodology performs reasonably well, 
which is reported in the last line of Table |2l 

Finally, we display in Figure[2]the simulated rejection probabilities of the tests for the hypotheses 
(11.3p and (Id.lOp for a non relevant change in the variance or correlation, respectively, as a function 
of the parameter 6 G [0, 2A]. The significance level is chosen as 0.1. As expected the probability 
of rejection decreases with 6 (see also the discussion in Remark [4.ip . 


5.3 Power properties 

In this section we investigate the power of the tests proposed in this paper considering the 
following four scenarios. 

(F) G{t,J^i) = (l(f < 0.5) + l(f > 0.5)\/l + J^i)/4, where A > —1, H{t,Fi) = 

0.5i/(f, J^i_i) + Sj for t < 0.5, and H{t,J^i) = —0.5Ff(f, J^i_i) + Ei for t > 0.5. 

(IF) G{t,Ti) = H{t,J^i)[l{t < 0.5)^/l-a^{t) + l{t > 0.5)v^(2 + A)(l - &2(t))J/8, where 
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Table 2: Simulated Type I error of the tests for a change in the lag 1-correlation for various 
bandwidths and the bandwidth calculated by generalized cross validation (last line). Left and 
middle column: test for the hypothesis fl3.8p (model (IV) and (V)). Right column: test for the 
hypothesis fl4.10p (model (VI)). 
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Figure 2: Simulated rejection probabilities of the test fl4.10p for the hypothesis of a non relevant 
change in the variance (left panel) and lag 1-correlation (right panel) as a function of the threshold 
5 e [0,2A] in the hypothesis fll.3p . 












^ — 0-5, and H{t,J^i) = ^ > O-^ 

function a and b are defined by a{t) = 1/4 + 1/2 and b{t) = 0.5 — (t — 0.5)^, respectively. 


(Iir) G{t,J^i) = (t, J^j), where H{t,Ti) = 0.3H{t, J^i) + e ioi t < 0.5, and 

H{t, Fi) = (0.3 — X)H(t, Fi) + e for t > 0.5. 

(IV’) G(f, Fi) = H{t^ Fi)^l — {t — 0.5)^/8, where H{t, J^i) = {0.5 — X)H{t, for t < 0.5, 

and H{t, J^i) = 0.7H{t, J^i-i) + £i for t > 0.5, 

Model (r) is used to study the power of the test fl3.7l) where the case A = 0 corresponds to the 
null hypothesis of a constant variance. The power properties of the test fl4.7p for a non relevant 
change in the variance is investigated in model (IT). Here we test the hypotheses Hq : A < 1/64 
versus Hi : A > 1/64, where the case —2 < A < 0 corresponds to the null hypothesis. Similarly, 
the power of the test for a constant lag 1-correlation fl3.18l) is studied in model (HI’) (again the 
case A = 0 corresponds to the null hypothesis of a constant correlation) and the corresponding 
hypotheses Hq : A < 0.2 versus Hi : A > 0.2 of a non relevant change in the lag 1-correlation 
are investigated in model (IV’) (here the case —0.4 < A < 0 corresponds to the null hypothesis). 
The rejection probabilities for various values of A are displayed in Figure [3l We observe that 
the proposed methodology can detect (relevant) changes in the variance or correlation with 
reasonable size. 


6 Data Analysis 

Various scientihc data show that since the late 1800s, the mean global temperature starts to 
increase significantly. For example, during 1906-2005, the Earth’s average surface temperature 
rose by 0.74 ± 0.18 °C, with the rate of warming also increasing with time. On the other hand, 
however, there are much fewer studies on possible changes in the second order characteristics 
(especially the correlations) of temperature time series, both globally and regionally. In this 
section, we are interested in identifying possible changes in the second order structures of re¬ 
gional temperature in the recent three centuries. To this end, we analyse the Hadley Centre 
Central England Temperature (HadCET) data from 1659-2015. These data can be downloaded 
from http://www.metoffice.gov.uk/hadobs/hadcet/. We present the analysis results of monthly 
temperature series in January and July as representatives of the winter and summer monthly 
temperature patterns in central England. The time series are shown in Figure 01 There are 
apparent increasing trends in both time series. 
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Figure 3: Simulated power of tests for a change in the variance or lag 1-correlation. Left upper 
panel: test for a constant variance defined in (13.71) (model (!’))■ Right upper panel: test for a 
non relevant change in the variance defined in (14.7p (model (II’)). Left lower panel: test for 
a constant lag 1-correlation defined in (13.181) (model (III’)). Right lower panel: test for a non 
relevant change in the lag 1-correlation defined in (14.211) (model (IV’)). 
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We first test the constancy in the variance and lag 1-correlation for the January data. In our 
analysis the critical values are generated by 8000 bootstrap replications. The bandwidths were 
chosen as described in Section |5l The results are summarized in Table [3l The test (I3.7|l rejects 
the null hypothesis of no change point in the variance at 5% level. We then use the statistic 
fl4.ip to estimate the change point and obtain = 226, which corresponds to the year of 1884. 
Next we apply the test fl3.7p again to the periods before and after the identihed change point and 
conclude that there are no further structural breaks in the variance during the two periods. The 
estimates of the variance before and after the year 1884 are given by 4.05 and 2.85, respectively. 
Next we apply (I3.18p to testing the constancy in the lag 1-correlation, where we use the statistic 
fl3.10p to estimate the change points in the variance with ( = 0.14 and L = 38. We identify 
= 242, which corresponds to the year of 1900. The result is close to the one which is obtained 
by the estimator (14. ip . The null hypothesis of no change points in the lag 1-correlation is rejected 
at 5% level [see Table |3]. Next we use the statistic (I4.12p to identify the location of the change 
point of the hrst order correlation and obtain = 213, which corresponds to the year of 1871. 
Again we investigate the existence of further changes in the lag 1-correlation before and after 
the year 1871 and conclude that there are no further structural breaks in the lag 1-correlation 
during the two periods. The estimates of the lag 1-correlation before and after the break point 
are equal to —0.108 and 0.231, respectively. 

The results from Section 0] enable us to perform tests for relevant changes in the variance and 
lag 1-correlation for the January data. Figure [5] displays the p-values of the tests for a relevant 
change in the variance and log 1-correlation for different values of the threshold S. At the 5% 
signihcance level, we conclude that there exists a relevant change with size S = 0.645 in the 
variance and a relevant change with size S = 0.313 in the lag 1-correlation. 

For comparison, we also analyse the July data. For the variance, we choose the bandwidths 
bn = 0.205 and m = 33. The test statistic (13.ip is 1.90, together with the simulated 90% critical 
value 1.83 and 95% critical value 2.04. Hence we cannot reject the null hypothesis of no change 
point in the variance at the 5% signihcance level. For the correlation, the bandwidths are chosen 
as bn = 0.26, Cn = 0.06 and m = 25. The test statistic fl3.14p is 0.84, and the simulated 90% 
critical value is 1.11, and the 95% critical value is 1.25. Hence, again we cannot reject the null 
hypothesis of no change points in lag 1-correlation at the 10% signihcance level. 

In conclusion, our data analysis suggests that besides the mean trend, there exists strong evidence 
indicating structural changes in the second order structures of monthly temperatures in central 
England. Further, the latter changes are inhomogeneous among diherent seasons, in the sense 
that changes in the variance and correlation are more signihcant in the winter than in the 
summer. This implies that winter temperatures in central England have become more unstable 
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and more difficult to predict since the late 19th century. Finally, we also locate the change 
point in the second order structure of the HadCET temperate data through all three ways we 
proposed in our paper. The three change points, 1871, 1884 and 1900, are quite close. Our 
hndings suggest that the time of change in the second order structure of our data coincides with 
that of the mean global temperature identihed in various previous studies. 


January Temperature 


July Temperature 




Figure 4: Temperature in °C of UK from 1659-2015 in January (left panel) and July (right 
panel). The lines are the fitted trends of the means by local linear regression. 


Table 3: Tests for the existence of a change point in the variance and lag 1-correlation in the 
HadCET data, n* denotes the critical values obtained by the bootstrap procedure. “Whole” rep¬ 
resents the whole period, “Before” and “After” represent the period before and after the detected 
change point, respectively. 



Variance 

lag 1-Correlation 
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Before 

After 
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Before 

After 

Test Stat. 
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1.53** 
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0.71 
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Figure 5: p-values of the bootstrap test for a relevant ehange in the variance (left panel) and lag 
I-correlation (right panel) for different values of the threshold 5. The horizontal line marks the 
significance level 0.05. 
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7 Proofs of main results 

In this section we provide proofs of the main results, where some of the technical details are de¬ 
ferred to an online supplement. Throughout this section the symbol ^ denotes weak convergence 
of a stochastic process in C(0,1) with the uniform topology. 


7.1 Proof of Theorem 13.IL 13.21 Lemma 13.IL Theorem 13.31 and 13.4 

7.1.1 Proof of Theorem 13.11 

In order to study the asymptotic properties of the statistic 7k we introduce the random variable 


T„ = max 

l<i<n 


Si - -S„ 


n 


where Si is the ith. partial sum of the PLS {e^}k=i, that is Si = h follows from the 


results of 


Zhoul (I 2 OI 3 I) that -^Tn sup^g^gp) where {17i(f)}tg[o,i] is a centered 


Gaussian process with covariance kernel 
We will show below that for the maximum deviation between Si and Si satishes 


rncQL\Si-Si\ = Op{\/nb^ + nhl^ + h^^). (7.1) 

l<i<n 

From (17. Ij) we obtain Tn — Tn = Op{\/nhf + nb^ + b~^) = o{^/n), where the last estimate follows 
from our choice of the bandwidth bn- This yields the assertion of Theorem 13. II 

For a proof of the remaining estimate fl7.ip we use the decomposition 




8'- — d 
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where the quantities An,i and Bn,i are defined by 

i i 

^n,i — ^ ^ ^ {f^bn (tj) ~ h'(^i))) Bjii — ^ ^ {^nitj) — i^b„(tj)) ■ 
i=i i=i 

Observing the estimate fl8.4p in Section EH] of the technical appendix we have 0 < Bn,i < Bn,n = 
Op{b~^ + all 1 < i < n, which implies 


max Bn,i = Op{b^ +nb^). 

l<i<n 


(7.3) 


By Lemma 18.11 (which is proved in Section 18.1|) it follows that 

i [nbn\ 


max 

[nbn] <i<n— [nbn] 


max 

n— [nbnj +l<i<n 


An,i 2 ^ ^ Cln,j 2 ^ ^ ~ Op{nXn)i 

j=lnbni+l i=l 

n—lnbn\ [n6nj —1 

An,i — ‘2 'y ^ — 2 ^ ^ ^jiP'bnitj) ~ f^(tj)) 

j=[nb„} i=l 

i 

~2 ^ ^ ^jifibnitj) — fi(tj)) = OpijlXn)) 

j=n-\nbn\+l 


where Xn = 


^n,j 



IL 


U = 


(7.4) 


A further application of the estimate fl8.4p in Section 18.11 and the Cauchy-Schwarz inequality 
gives 


max 

l<t<L’3braJ 


j 

i=l 




max 

n— [nbn\-\-l'^j'^Ti 
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Y1 

i=n—\nb„\+l 


\nbn\ 

< - fibAij)\U 

i=l 

= 0{\/^ + nbl), 

= 0{^/^ + nbD. 


37 




























This implies that 


max < max \An,i\ + Op{^/^ + nbl), (7.5) 


where An,i = 2 X]j=[n6„j is defined in (TTTj) . 

In the following we derive an estimate for the first term on the right-hand side of 07.51) . For this 
purpose we consider the random variables es,m = lE(es|es, ...,es-m) and note that the sequence 
is m-dependent. Now define aj™’ ^ Sj Kb^ (^)es,m/(ni>n) and 


47 = 2 5^ 


Jm) 

^n,j y 


i=[nfe„J 


then a similar argument as given in the proof of Theorem 1 of I Zhod (120141) shows that 

n 

— -){es,m-es) <C^/^r, 

^ 71 ^ A 


max I 
l<j<n I 


■bUix 


s=l 


for some constant x ^ (0,1). By the Cauchy-Schwartz inequality it follows that 


max \An,i - A; 

lnbn\<i<n—lnbn.\ ’ 


With the notations = Cj^m XLi i^)(^s,m/{nbn) and A''^’ = 2 X5=[n6„j i^ i® 
see that 
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n-[nbni n 

— l^il 
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j=lnbn\ s=l 
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(7.6) 


r(m) _ r) 


.(m) 


r (m) A (m) I 


max \^ni - A 

lnbn\<i<n—lnbn\ 


J=1 


— -)es, 

^ 71 


S = 1 


'■bn\ J^s,m 

n 


(7.7) 


Now an elementary calculation via Burkholder’s inequality shows 


max 


nh 


S=1 


s-3 


n 


= O 




and by a similar argument as given in the proof of Theorem 1 of IZhonI (120141 ) we have for some 
constant x ^ (0) 1) fhe estimate maxi<j<„ — e^ld = 0(x”^)- This gives for the left-hand 
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side of fl7.7l) 


max 

[nfe„J<i<n—[nftnj 


_ Am) 


2 




and an application of 07.61) yields 


max 

[n6n J [nbn 




(7.8) 


A tedions bnt straightforward calcnlation shows that Vj-i{ej^rn^i^rn) = 0 for / > 2m. For example, 
a i > j — m, then by dehnition, ej^rn^i^m is a{ej- 2 m,£j- 2 m+i^ measnrable. Conseqnently, 

j—l) if ^ ^ 2??7., which giveS 'Pj—l{^(ij^ra^i,m) 0- 

The other cases i < j — I — 1 and j — m — 1 are treated similarly, and details are 

omitted for the sake of brevity. Observing Vj-i{ej^rn^i^m) = 0 for Z > 2m we obtain 


max 

lnbn\<i<n—lnbn\ ’ ’ 


(m) I 


2m 

1=0 


max 

[nbn\<i<n-[nbni 

j=lnbn} 


E 


A'^)\ 


Similar argnments as given in the proof of Theorem 1 in IWul (120051) show 


(7.9) 


lIPj-io 


(m) I 

n,j I 


2 E 


M 


n 




S=1 


rKr,. 


n 


■'j,m 


E 

S = 1 


p(i-0 m. 


n 


and by the triangle ineqnality it follows that 


\\Vj-ia. 


(m), 

n,j I 


< M(Zij + Z2j), 


where the terms Zij and Z 2 J 



are dehned by 


nbr, 


S=1 


f_ l\ IgO-O _ g 1 

\ys,m ^s,m} 


nbr, 




n 

n 


s=l 




n 


2 


2 


e^iL = E(ei'^^|es_m, e^,E s) for s - m < j < s, = Gi{ts, for k < ts < 6z+i and we 
nse the convention ei{m = es,m for j<s — moij>s. Elementary calculations show that for 
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I > 0 


n 4 


S=1 


while by definition — e^mlU = 0 for / > m. On the other hand, if 1 < j < n, 0 < / < m, 

we have by Assnmption (A4) 


;0--0 

■'j,m 




= llE(ej 


di-O 


^j-m: ■■■1 ^j-h ^j-li 


<Mx\ 


which gives Z 2 ,j = Observing that — es,m = 0ifs>j — / + m + lors<j—/ — 1, 

it is easy to see that Zi^j = O(^). It now follows from Doob’s inequality 


max 

[nfe„J<i<n—[nftnj 


j=[nbn\ 


2 



m 

nbn 


1 


and we obtain from fl7.9p that 


max qiy - EA'Z 

[nbnl <i<n— [nb„J ’ ’ 


Ml 


m 


2 


+ (K 


.- 1/2 


(7.10) 


Finally, similar arguments as given in the proof of Lemma 5 in IZhou and Wd (120101) show 


max 

lnbn\<i<n—lnbri\ 


E[y,„i = o(5;^5;^x'-^V(«w) 

i=l j=l 


o{bA)- 


Observing fl7.5p . fl7.8p and fl7.10p and taking m = Mlogn for a sufficiently large constant M > 0 
yields maxi<j<„ = Op{\/nbn + nb\ + b^)- Consequently, the assertion (17.ip follows from 
(I7.2p . (17.3p and this estimate. 


7.1.2 Proof of Theorem 13.21 

We recall the definition of in (13.Op and define on the interval [0,1] the linear interpolation 

The assertion follows if the weak convergence 
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conditional on Tn can be established. For a proof of this statement dehne and 
by replacing the nonparametric residuals by the (non-observable) errors in the 

dehnition (I3.6p and (17.111) of and respectively. Note that similar arguments as 

given in the proof of Theorem 3 in IZhod (120131 ) show that {<Fm,n(^)}te[o,i] ^ {^i(^)}te[o,i]- The 
assertion of Theorem 13.21 then follows from the estimate 


sup \^m,nit) - ^m,nit) \ = Of 


ie[o,i] 


m log^ n\ 1/2 


nb' 


^12 


+ ■s/mh'^^\ogn). 


(7.12) 


In order to prove (17.121) let C denote a sufficiently large constant, which may vary from line to 
line in the following calculations, and consider the event 

7l„ = I sup |/ife„(t)-/i(t)| < C(—+ 6^ logn]|. 
l te[0,l] ^ ./'nh hJ / 


y/nbj}n 


By Lemma (8.31 of Section IHTT] in the technical appendix we have that lim^^^oo IP(^n) = 1- This 
yields for l<j<n — m-|-l the estimate 


j+m—1 




[6r — e* 


< C 


r=j 

m? log^ n 


j+m—l 

)^ {2er + ir — CrY I {An) 

r=j 


nb, 


3/2 


-I- rn^bn log^ nj. 


Similarly, it follows that E[(S'n — Sn)‘^^I{An)] < C{ ^ ^ 3^2 -|- m^fe^log^n), which gives 

Tlbn 


n—m+1 


\\{^n-m+l-K-m+l)I{An)\\l = — -— V - -{S^ - Sn)f I{A^) 

m{n — m + 1) L' n 

^ ’ i=i 


< C 


m log^ n 

~W~ 


-|- log^ n j. 


An application of Doob’s inequality and Proposition 18.21 in Section [8]2] dually yields 

'vn log^ n \ 1/2 


max = O 


l<i<n—7n-\-l 


nb, 


3/2 


^/mbn log n 


The estimate (I7.12p now follows from this result and dehnition (I7.1ip . which completes the proof 
of Theorem 13.21 
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7.1.3 Proof of Lemma 13.11 


Define AA(i) = ;^ (X]j=i-L+i recall the definition of j\4(i) in fl3.1ip . By 

similar arguments as given in the proof of Lemma 1831 in the technical appendix (note that t > 8) 
we have — A/'(i )||4 = 6 ^ + and Proposition 18.11 yields 

L<.?S+i + -^). (7.13) 

Consider the case that i & B ■= {i ■. \ti — > L}. Then by our assumption on the variance 

function, there exists a large constant C, such that |EA/'(f)| < CL/n for L < i < n —L + 1, i E B. 
By Lemma f8.3l and Lemma 18.41 in the technical appendix it now follows ||A/'(f) — EA/'(«)||i /2 < 
(L<i<n — L + 1 , iGS), which gives 

= Op(L-^/ 2 ^ 2 A + Lin). 

L<t<n—L-\-l,i£B 

Combining this estimate with fl7.13p yields 

max \M{i)\ = + L/n + n^/%1 +-. 

Similarly, we can show that Al(Lnf^J) = cr(f+) — (T{t~) + Op{n^^%‘^ H —577172 + + L/n). 

TL ' Oji 

The choice of L implies that 

lim p(|Af([nfJ)| > nii« |> 1 (*)|) = 1 , 

n—>-oo \ L<i<n—L+l,i£B / 

which completes the proof of Lemma 13.11 □ 


7.1.4 Proof of Theorem 13.31 

We restrict ourselves to the case of a variance function with no change point and the corre¬ 
sponding estimator 03.91) . The statement for the estimator 03.12p follows by similar arguments 
as given in Theorem 14.31 where we deal with the problem of testing for relevant changes in the 
correlation. 

Recall the dehnition 03.15p . dehne Sf' = as the corresponding partial sum and consider 

the CUSUM statistic = maxi<j<„ \SY — We will show the estimate 

max 15,^ -SY\= Op{ncl + nh\cl^l^ + (7.14) 

l<2<n 
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which implies f^ — T^ = Op{nc^ + nh%Cn^^^ + It follows from IZhod (12013[) that T^jyjn 

converges weakly to the distribution of the random variable /C 2 defined in Theorem 13.31 By 
our choice of the bandwidth bn we have nc^ + = o{y/n), and the assertion of 

Theorem 13.31 follows. 

For the sake of simplicity we omit in the subscripts c„, bn in the variance estimator (Tc„, 6„ and 
the superscript k in the definition Wj^, the proof of the estimate (17.141) . With the notation 






FT) 


we obtain 


max V (Wi - Wi 
l<j<n I \ 
i=l 


< max ^ ^iU+k)\ ^ 


l<j<n 


i=l 


a'^{ti)a{ti+k) 


(7.15) 


where we have used the fact that the variance function is Lipschitz continuous. Let Wi = 
denote the analogue of Wi, where the estimate has been replaced by the “true” variance 

cr^(ti). By a careful inspection of the proof of Theorem 13.11 it can be seen that 


] 

max {Wj - iw) = Op{^/^ + nb\ + b~^). 

l<j<n \ ^ ^ \ / 

“ i=l 


(7.16) 


Define 


^ := E(»'. - Wi) = E 


iiCi+ki-^^iti) + a'^iti)) 


i=l 


2=1 


a'^{ti)a^{ti) 

then our next goal is to estimate maxi<j<„ |Aj|. For this purpose we consider the random variable 


^_ ^iGi+k{—b'‘^(ti) + cr^(tj)) 

2=1 


(T^(L 


(here the estimator in the denominator has been replaced by the true variance function), and 
obtain 


max Aj — Ad < - 

l<j<n 

i=l 


eiei+fc|(a2(fi) - a‘^{ti)f 


a‘^{ti)a^{ti) 


(7.17) 


For the expectation of the right-hand side it follows 
\eiei+k\{^^{ti) - a‘^{ti)y 


E 


2=1 


a‘^(ti)a'^(ti 


< ||ei||4||ei+fc||4||((j^(fi) - a‘^{ti)y\\2. 

2=1 


(7.18) 
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By Lemma 18.31 of Section 18.11 in the technical appendix we have that 


- f^it)\U = 0(bl + (7.19) 
which implies ||ei ||4 < C. On the other hand, Corollary 18.11 in Section IHTTl shows 

™ ||(<j2(ti) - a\ti)y\\2 = Ofe + ^ + 4 + —(7.20) 

\ TtOrfl nCn^ 

and we obtain from fl7.17p . fl7.18p and Proposition 18.11 in Section [ 8 ^ the estimate 


max |Aj| < max |Aj| + max |Aj — Aj\ = max |Aj| + Op{nb^ + + nc^ + c„^) 


(7.21) 


Now the remaining problem is to derive an appropriate estimate for the qnantity maxi<j<„ \ Aj 
For this pnrpose note that Aj = Xj^i + Ajg, where 


i=l 

h. = t- 

2=1 


(eje^+fc - eiei+k){iy^iti) - 
eiei+k{cr‘^(ti) - 




By Lemma 18.11 Corollary 18.11 of Section 18.11 and the estimate (I7.19p it is easy to see that 


E 


max lA^il 


< 


E 




cr^(ti 


E 


max |Aj^ 2 | 

l<j<[nbn+ncn\ 


2=1 

[nb„+ncnj .. .. 

^ Y1 = 0(7r„), 

^ cr (P) 

*=i ' ' 


(7.22) 

(7.23) 


- l^-Fhn+nc„J-l,2|+ 

i=n—lnb„+ncni 




E 


— |-^j, 2 | + / 

[nb„+ncn\<j<n-[nbn+ncn\-l ’ f- —^ Cr^(tj 

i=n—[nf)n+nc„J 

= max |Aj, 2 | + Op(7r„). 

[nb„+ncn\<j<n-[nbn+ncn\-l 




(7.24) 


where the constants tt^ and 7 r„ are given by = nb‘^c^ + ^/^b\ + + ^ 5 =, tt^ = 

{nbn + ncn)ipl + cl + ^ + ^), respectively. 

In order to prove a corresponding estimate for the remaining term mBx^nb„+ncn\<j<n-lnbr,+ncn\ I Ajgl 
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in fl7.24p we study the asymptotic behavior of the quantity a'^{t)—a^{t). By Lemma 18^ in Section 
18.11 it easily follows that 


sup 

tG%7l 


»Ht) - At) - - — E AiU - ml - E(e?)) 

2 nCn 


2 = 1 


= 0{ci + 


nCr 


(7.25) 


We now consider the decomposition 

n n 

Y. rnxti - mi - mi) - (el - ml))) = Y - mi, 


2 = 1 


2=1 


where Qi = Qi,i + Q 2 ,i, Qi,i = 2ei[fi(ti) - Q 2 ,i = [/i(fi) -/i(ti)]^. By LemmaEIIlin Section 
18.11 we obtain 


sup 

[nbnl <i<n— [n.b„J 


P'bAti) - Kti) - ^ ^ ejKb„{tj - ti) = 0{hl + ^). 


i=i 


The triangle inequality and Proposition 18.11 in Section 18.21 imply 


sup \ ^KcSU-t) [Qi,i 
iST" I 


2ei 

nbr, 


'y ^ ^j) 


i=i 


fi2tl{ti)blei] = Oinblcl^^), 


(7.26) 


where we use the notation T" = [bn+Cn, 1 ——c„]. Similar arguments as given in the calculation 
of mBx^nb„\<i<n-lnbn\ \^n,i\ In the proof of Lemma rO and the summation by parts formula show 


2 

sup — 
tex" nbn 


y ^ ^Cn (ti t)ej ^ ^ CjKfj^{ti tj) 
i=l j=l 


0{b-^), 


sup 

iex" 


n 

y ^ -t^c„(tj — t)/r2/l(ti)6„ej 
2=1 




and (|7.26p gives || sup^g^;^ | Ym=i ^bAU — t)Qi,i |||2 = 0{jib\(M^ + b^^ + n^^'^bl). On the other 
hand, note that 


sup 

i6X" 


Kc^{ti - t)Q2,i 

2=1 


^ Rn,l H“ Rn,2 
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where 


R-n,l 


sup 

‘6^" i=l 


^c-n(tj ^j^b„{tj ^i)+ 


fJ‘2fj‘{ti) t 2 ^ ^ 

o 


j=l 


-Rn ,2 — 


sup I - t) U{ti) - fi{ti) + ^ ^ ejKb^iti - tj) + 


t&' 


i=l 


j=l 


X (^jj,{ti) ft{tl) ^ ^ ^j^bn {tj tj) 


t^2t^{ti) 


i=i 


Proposition 18.11 in Section 18.21 and similar calculations as given in the proof of Lemma 17.11 show 
that 


Rn,i = o(nCnC^R^{^ + 6^)) = + ncl/%1), 


while a further application of Lemma 18.11 in Section 18.11 yields 

nb^ 


= O + nbyf-) = 0(yibrcy + nbyf) 

Consequently, combining the arguments in fl7.25p - fl7.27p . it follows that 
a\t)-a\t)- 


(7.27) 


sup 

t&V' 


2(+ \ 2 ^ 

-=OU»). (7-28) 

^ TI/Cyi ' 


i=l 


where 7r„ = 4 + ^ + ^-f Cn ^ + bn^\ncn) R^ + blcn^^^. 

Recall the definition of hR, define Z' = — Ee^, then it follows from fl7.28p that 


E 


max 

[nbri +ncn\ <j<n— [nfe„ +nc„J 


T Kc^tj - ti)Z[ + ^2^^{ti)ncl/2) 

^i-2 + 2^- 

2=1 


a^{ti)ncy 


< 


/ ^ 11 ^ 2 ^ 2 +/^ II 2 

i=\nbn-\-nCn\ 


cr‘^(L 


(7.29) 


= 0(n7r„). 

By the Cauchy-Schwarz inequality we obtain ||lRi — lU = 0(x'*~”*')) W^i ~ = 
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(m) (m) 


where - E(efV, w/’") = and 

eS™) if 6, <t. <6,+i. 


Hence, with similar arguments as given in the proof of Lemma 5 of IZhou and Wul (1201011 we get 


max E 

[nb„+nc„J<j<n— [ubn+ncni 


i=[nbn+nCn\ 


a‘^{ti)nc„ 


= 0{c-^). 


Then by a similar m-dependent approximating technique as given in the proof of Lemma [7.II we 
get 


max 

lnbn+nc„\<j<n— [nb„+ncni 


^ - ti)Z’ - EUpA'^Ji, - A)Z'l 

i=lnb„+ncn\ 


a‘^{ti)nCr, 


= Op{c^^] 


Similarly, and more easily one obtains 


j 

. , y^eiei+fc/i2cx2(ti)62/(2a^(t.)) =Op(y^c^). 

\nbn+ncri\<J<n—\nbn-\-ncn\ I —f 

1=1 


(7.30) 


Hence, it follows from (17.2911 and (I7.30P that maxLn6„+„c„j<j<n-Ln6„+nc„j |Aj- 2 I = Opin-nn + ncl), 
which implies, observing (I7.22p - (17.2411 . 


max lAJ = OpiiTn + 7r„ + min + ncl). 

lnbn+ncn\<j<n—lnbn+ncn\ 

Combining this result with the estimates (I7.15p . (I7.16P and (17.2111 . and by our choice of the 
bandwidths, we have that 


max 

l<j<n 


- Wi) =Op{ncl + 


2=1 


which establishes the estimate (I7.14p and completes the proof of Theorem 13.31 


7.1.5 Proof of Theorem 13.41 

We restrict ourselves to the case of a variance function with no change point and the corre¬ 
sponding estimator (13.911 . The statement for the estimator (I3.12p follows by similar arguments 
as given in Theorem 14.41 where we deal with the problem of testing for relevant changes in the 
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correlation. 

Recall the definition of Wi and Wi in fl3.13p and fl3.15p and introduce the notation PR = 
(again the superscript is omitted in our notation). We consider the corresponding partial sums 
S,,m = ~S,,m = Y^rt7~" and S^m = ^ud define = Er=lWr, 


+ fc 
) 


Sn = Er=i^r, = Er=i^>- Similarly, define <hm.n(P)), {^Im, and 

by replacing {Sn, 4',m) in the definitions ([SS]), dZHH) of <1)*,^ and by (5'„, 


{Sn, Sj^m) and {S^, S^m)^ respectively. It then follows from the results Zhou ( 2013 1 that 
{‘hm,n(t)}te[o,i] ^ {^ 2 (t)}te[o,i], where {U 2 {t)}t£[o,i] is a centered Gaussian process with mean 0 
and covariance kernel fl3.16p . The assertion of Theorem 13.41 is now a consequence of the estimate 


sup |<hm,n(P) - ^m,n{t)\ = Op{^/m6n), 

i6[0,l] 


(7.31) 


where 6n = {(^ + {~^^ + logn. For a proof of this estimate we consider the 


' i/nCn 


\/nhn 


events 


yl„ = I sup |/ib„(t)-/i(t)| < G +G&nlogn!>, 

^ t6[0,l] VnOnOn ^ 

Bn = I snp \a^{t)-a‘^{t)\<c(cl + {^^ + bl + ^=)c-^^^]\ogn.V 
t te[o,i] ^ ^/ncn \/nb„ f > 


then by (18.Sp and Corollary 18.21 in Section l8T] it follows that hm„^oolP(^n H Bn) = 1. Using 
similar arguments as in the proof of the estimate (I7.12p it can be shown that 

11(4:;,™ - 4„,™)/(A„ nB„)i |2 - '^((?)' 

||(4;„ - l„,„)/(/!„ nB„)f < Cmdl 


The assertion of Theorem 13.41 now follows by a similar argument as given in the proof of Theorem 

[321 □ 


7.2 


Proof of Lemma 14.1! - 
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In order to simplify the notation define Gn{m) = S^ — ^Sn, Gn{'m) = Sm — ^Sn, where 
Sm = YTiLi ^rn = Y7d=i ^7 Then it is easy to see that the estimator in of the change point 
in the variance function defined in (14.ip can be represented as in = ^ argmaX]^<^<„(G„(m))^. 
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Similarly, we introduce the notation 


Gn{m) = 


^j^j+k 


i=i 


a 


2 * 




10 ^ ^ 

j=i j' 


and obtain the representation argmaX]^<^<„(Gn(m))^ for the estimator of the change 

point in the correlation function dehned in fl4.12p . 


7.2.1 Proof of Lemma 14.11 

Recall that Section I4?T] considers the problem of testing for a non relevant change in the variance 
and that under null hypothesis, we have ct^(s) = cr^ for s < iy and cr^(s) = cr^ + 7 for s > 
where 7 is an unknown (without loss of generality) positive constant. A simple calculation shows 
that 


fn{m) := EGn{m) = n{m{n)t{n) — m{n) A (7.32) 

where we used the notation m{n) = m/n and t{n) = [niy\/n. By Proposition 5 of IZhoul (120131) . 
on a possibly richer probability space, there exist i.i.d standard normal variables, say {Rijiez, 
such that 


ZhonI (1201311 


max 

l<i<n 


i 


Op(n^/'^logn), 


(7.33) 


j=i 

where Ki is dehned in assumption (A5). By the arguments given in Section 17.1.11 we have 


max \Gn{rn) - G„(m)| = Op(pn), 

l<m<n 


(7.34) 


where p„ = b ^ + n bt + y/nbn- Now a similar reasoning as given in the proof of Lemma 5 of 


Zhou and Wul (j 2010 l) . Assumption (A3) (A4) and (A5) yield that there exists a constant G such 
that Ki{s) < G for all s G [0,1]. Then it is easy to see that ||S „||2 = 0{n). By Doob’s inequality, 
we have that 


max |S,| = Op(\/n), (7.35) 

l<j<n 


49 





















and observing fl7.33p we obtain 


max \G^{m) — G^(m)| = max |G„(m) + (j„(m)||G„(m) — = Op{nQn). 

Km<n l<m<n 


Define Vn{m) = G\{m) — G\{\nt\), note that Vn{\nt\) = 0 and consider a constant a G (|, |), 
snch that /Qn —)■ cxo. Observing the definition fl7.32p and the estimate (I7.33p . it follows that 


max 

l<m<n 




logn). 


(7.36) 


By (17.351) . we have maxi<m<n(Sm — ^^n)^ = Op{n), and together with 07.341) and 07.36|) this 
yields 


max Vnim) = mi« [Gl{m) - Gl{[nt\)] + Op(np„) = max |/^(m) - fni[nt\) 

m£Mn meMn meMn t 

+ 2{fn{m) - fni[nt\))Em + 2fn{[nt\){Era - ^[nt\) 

- 2—fn{m)En + 2^^fn{[nt\)EA + Op{nQn + n^/^logn), (7.37) 

where the maxima are taken over the set = {m \ \ni^\ — < m < [_ni^\ — |}. 

Observing the definition of fn{fTi) in 07.32p we have for some positive constant O, 

max [flijn) - fl{\nt\)] < -Gn^-^, (7.38) 

TTIG-A/Itt. 

and (I7.35P implies 

max [fn{m) - fn{\nt\)]E^ = Op{r?l‘^~°'l‘^ logn), (7.39) 

mGAin 

max [—/„(m) - -^^/„([nfJ)]S„ = Op{r?l'^~°'l‘^ logn). (7.40) 

meMn n n 

Using the representation = Ylt^L+i similar argnments as in the derivation 

of 07.351) yields 

max [Em - = Op(nB2(i-“/2) 

m£Mn 

Conseqnently, 

max fn{[nt\)[Em - S^np] = Op(n^/^""/'^logn). (7.41) 

meMn 


50 























By our choice of a, fl7.37p - (17.411) it now follows that 


P( limsup max _ Vn{m) = —cxo] = 1. (7.42) 

On the other hand, similar arguments give the estimates 


l<m< lntv\ — 

max fn ( lnt \) - S ] 

l<m<Lnt„J-Lnl-“/2J 

inax [—fn{m) -)].=.„ 

l<m< Tl Tl 

max [fn{m) - fnilnt\ )]S^ 


= Op(n^/^logn), 

= Op{n^^^\ogn), 

= Op(?7,^/^ logn), 


and by our choice of a we obtain P(hmsup„^oo maX]^<^<l^„j^j_l^^i-a/ 2 j Vn{m) = —oo) = 1. Com¬ 
bined with fl7.42p this gives P(hmsup„^oo maX]^<^<l^„£^j_l^„i-aj Vn{m) = —oo) = 1, and it can be 
shown by similar arguments that P(limsup„^oo max|^^j^j_,_l^„i-c«j<,„<„ = —cxo) = 1. Conse¬ 

quently, it follows that 


lim P(|nt„ — \niy\ \ < ") = 1, 

n—^oo 

which proves part (ii) of Lemma 14.11 For the case that the variance has no jump at time t, 
the desired result follows from the fact that Gnijn)/^/n converges weakly to some Gaussian 
process {tli(s) — st/i(l)}se[o,i], which implies —> T = argmax^g^Q^^j l^i('S) — st/i(l)|, where 
the Gaussian process {t/i(s)}sg[o,i] is defined in Theorem 13.11 


7.2.2 Proof of Lemma 14.21 

By a careful examination of the proof of Theorem 14.31 (see the next section) it follows that 
max |G'n(m) - (5n(m)| = Op{ncl + logn -h b~^c~^), 


l<m<n 


where 


Gn{m) = 


m 

\ '■ ^j^j+k 




m 

n ^ 

i=i 
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Write + b~^c~^ + logn, where a is defined in the proof of Lemma HTTl 

Let 1/2 < a' < 2/3, such that j —?• oo. Then using similar arguments as given in the 
proof of Lemma 14.11 we can show that 


in -tn = Op{n 

if there is a change in the correlation at time t. On the other hand, if there is no change in 
correlation, we have that tn —> Tmax, where T^ax = argmax^g^g \U 2 it) — tl72(l)|, and the 
stochastic process {t/ 2 (s)}se[o,i] is defined in Theorem I3.31 


7.2.3 Proof of Lemma 14.31 

Recall the definition of (| 4.17p and define An,i = Y}^=i ^n ,2 = Yiij=int\+i (^^e 

superscript k is again omitted). From the proof of Theorem 13.31 we have that 



Since A = E[A 2 ] — E[Ai] we have A„ := A „^2 — A„_i = A + Op{l/^/n). In order to prove this 
estimate we introduce the notation An = {\in — 1\ < Then by Lemma [4.21 we have that 

lim„^oo E(v4,„) = 1. This yields 


(A„4 — An^i)I{An) — I{An){An + + O^), 


where 


\nt\ 


w. 


[nir, 


Wi 


[nir, 






w, w. 


^ V[ntJ [nt\ 


[nir, 


. E 


W, Wi 


^ VLntJ [nin\ 


(7.43) 
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It is easy to see that I {An) An = 0p{^^^). Using the same arguments as in the proof of Theorem 
14.31 we obtain I{An)Bn = o{^/n/n) = o{l/^/n) and 


I{An) ■ Cn 


[nt\ 

/(A) li'y 

i=i 


[ntj — [ntj 
[nfj \ ni\ 


< C ■ I{An) 




i=i 


logn 

riy/n 


\ni\ 

c./(A)(E‘n+o.(v^J))^ 

i=i 

\nt\ [n£j [ntj 

I{An) ■ E Wjl{t < i) — fUj/(t > f) + Op(\/n)j 

i=i j=L«iJ 



logn 

n^/n 


(7.44) 


Combining (17.43^ - fl7.44p and using Proposition 18.21 in Section lOl shows A„^i —= Op(^^)- 
Similarly, we have An ,2 — An ,2 = 0p{^^{^), and the assertion of the lemma follows. 


7.3 Proof of Theorem 14.31 and 14.4 

7.3.1 Proof of Theorem 14.31 

We consider the non-observable analogue 

of the statistic dehned in fl4.22p . where the process Un is given by 


[nsj 

Un{s) = -Y^ 


--E 

-n ( ^ 


^j^j+k 


n “ a{tj)a{tj+k) n ^ a{tj)a{tj+k) ' 
It follows from the proof of Theorem 13.31 that 


{y/n{Un{s) + {s At- st)A)}s^[o,i] ^ {U 2 {s) - sf/2(l)}se[o,i], (7.45) 

whenever A 7 ^ 0. The continuous mapping theorem, elementary calculations, and the identity 
3 — s = f^(l — t)^ imply ^/n{TJ{ — A'^) ^ 2 (A), where the random variable 2 ^ 2 (A) 


53 



























is defined in Theorem 14.31 Finally, we show the estimate 


- Tn) = Op{n^/‘^cl + + n ^/^Cn + n (7.46) 

where C„ = n^““ + nc^ {n~^Cn^l^ + + (:^ 7 = + , a = 1 - 4/i -for some u > 0, 

which completes the proof. 

For a proof of this remaining estimate we consider the case A > 0, where it follows from Lemma 
13.II that |t* — iy \ = Op{n~°'). The proof of the statement in the case A = 0 (which corresponds 
to in Tmax) IS easier and omitted for the sake of brevity. For the event An = {\tn — iv\ A 
it follows from Lemma [3.11 that lim^^ P(A„) = 1. Observe that we have from Lemma [8.41 


sup 


' 2 * 








1 

A 





where Tn = [cn,tv — n “ — c„] U + n " + c„, 1 — Cn]. Also fl8.9p still holds. Now similar 
calculations as given in the proof of (17.141) yield 


max 


max 

[n{iv—n-°‘—Cn)}<l<[n{iv+n-°‘+Cn)} 


7 ^j^j+k 

^j^j+k \ 

\ ^ \a{tj)a{tj+k) 



E 

j=[n{ty -n-°‘ -c„)\ 


I (An) = Op{ncl + nblcn ^), 

I{An) = Op(Cn), 


a{tj)a{tj+k) ^n{tj)y 


max 

ln{iv+n-<^+Cn)\<l<lns\ 


E 

j=ln{tv-\-n "+Cn)J 


^ ^3 \ 


where we used Corollary 18.31 for the second statement. So we have 


I{An) = Op{ncl + nblcn^^"^ + b^^Cn^), 


sup n\Un{s) - Un{s)\I{An) = Opijicl + ublCn^/'^ + + Cn) ■ 

0<S<1 

Using the same arguments and Proposition 18.21 in Section 18.21 we obtain 


sup n\Un{s) - Un{s)\ = Op{ncl + 
0<s<l 
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From fl7.45p it follows that \Un{s)\ds = Op{l). Consequently, we have 

/ [Unis) - Uni^)]ds < sup - t/„(s)| [ \Un{s) + lfn{s)\ds 

Jo 0 <S <1 Jo 

< 2 rd-U sup - Un{s)\ [ \Un{s)\ds + sup \Un{s) - Un{s)\^ 

0 <S <1 Jo 0 <S <1 

= OpinU^cl + nU'^blc-U‘^ + n-U^Cn + 

and the remaining estimate fl7.46p follows. 


7.3.2 Proof of Theorem 14.41 

Recall the dehnition of A„, Aj, in fl4.17|) . fl4.18p . (14.191) and dehne 

A, = ' - AI(j > \nt\), 


(|)^ = 
2 ,m 


(j(tj)(7(tj^jf) 
1 


n—m +1 


E 


■Y/m(n — m + 1) 

where = YA^=i ^r- We introduce the processes 

= ^tns\,m + i^S " M ) 

= ^tns\,m + i^S - M ) + l,„j “ <^fnq,m)- 


(7.47) 


Zhoul (I 2 OI 3 I ) that {<F;^„(s)}s6[o,i] ^ {[/ 2 (s)}s 6 [o,i] conditional on 


and note that it follows from 
J-n- The assertion of Theorem 14.41 is therefore a consequence of the estimate 


sup |<h;^,n(s) - = Op{^/^Pn), 

se{ 0 ,i) 


(7.48) 


where = [n "cn^^'^ + c^ + (^= + 6^ + ^=)cn^^'^) logn. To show (|7.48p . dehne .4.„ = {\in-t\ < 

' v/ >^^Ti v TiOrj, ^ 


1 


s^}, e„ = {|A„ - A| < S^}, C„ = {sup,„„,„ - m < + CPJosn], 


/nc„ n Pnbn 

■,2 


F>l,n = \ sup |(T2*(t) - CT^(f)| < CPn [, 

U2,n = { sup |cr2*(f) - Cr^(f)| < Cpn [>, 

tep„+n-“,l] 
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where C is some sufficiently large constant, a is defined in Corollary 18.31 By construction it 
follows lim„^oo IP(>Vn) = 1, where Wn = Ani3„nCnnPi,„nP2,n- Let Aj = > [nt\). 

Similarly to the definition (I4.19p and (I7.47p . we define S:^^, 4*^^) where the random 

variables Aj are replaced by Aj. From the proof of the estimate fl7.3ip it follows 


sup 

* 6 ( 0 , 1 ) 



(7.49) 


On the other hand, 


El( A. - StJ^HWn)] 

j-\-Tn—l j+m—1 

=e[ ^ /(>V„)(A(/(r > [ntj) - I{r> [_nin\))) + ^ /(>V„)(A - A)J(r > [riAj) 

r=j r=j 

jf+m —1 

<Ce[ ^ /(>Vn)(A(J(r> [ntj)-/(r> LniJ))) 

r=j 



Note that J(W„)(A(J(r > [ntj) — I{r > [nin\)))]^ = 0 if j < \_nt — C\ogny/n} — 

m + 1 or j > [nt + Clogny^J, and is bounded by if j G [max(l, \nt — Clogn-^nJ + m — 
1), min(n, [nt + C \ogn^/n\)]. Thus we have 


n—m+1 


(n — m + 
and similar arguments lead to 

1 


i l?7l 

j = l 


n 


n 


(7.50) 


n—m+1 


in — m + ^ 

^ ^ 7 = 1 


Y. E - str-m,. 


m 


j 


< 


Cm log^ n 


n 


(7.51) 


Now similar arguments as given in the proof of estimate fl7.31l) together with fl7.50p and fl7.5ip . 
yield 


sup 

* 6 ( 0 , 1 ) 


= Op 


mlogn\ 


and the assertion fl7.48p follows from fl7.49p . 
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8 More technical details 


8.1 Uniform bounds for nonparametric estimates 

The following two lemmas provide uniform bounds for the estimate fib„ in the interior T„ = 
[bn, 1 — bn] and at the boundary = (0, bn] U [1 — bn, 1) of the interval [0,1]. 

Lemma 8.1. If assumptions (Al)-(A3) are satisfied and bn —)■ 0, nbn oo, we have 


sup 

tG%n 


P'bni't) - Kt) 




1 

nbn 


n 


^eiKb„{h-t) =0{bn + —). 


where ‘In = [bn, I - bn]- 
Proof. With the notations 


1 t — t 


2 = 1 


1 

nbr 


n 


i=l "■ 


{I = 0,1,...) we obtain the representation 

PbM 

bnfibnit) 

for the local linear estimate where the last identity dehnes the 2x2 matrix Sn{t) and the 
vector Rn{t) in an obvious manner. By elementary calculation and a Taylor expansion we have 


Snfiit) Sn,lit) 

— i 

Rnfiit) 

Sn,lit) Sn,2it) 


1_ 


S-\t)Rn{t), (8.1) 


Sn{t) 


Pb„{t) 

bniPbnit) - Kt)) 


i Er=l eiKbfiU -t) + 

Tk^l,e.KbAP-t)C-f^) 


+ 0{bl + bn/n) 


uniformly with respect to t G In- Note that Sn,o{t) = l + 0(^) and Sn,i{t) = O(^), uniformly 
with respect to t E In, which yields 


sup 


Pbfit) - frit) - _ ^J2eiK,^iti -t) =0{bl + ^). 


2=1 


Therefore the lemma follows from the definition of the estimate fibn in 


□ 
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Lemma 8.2. Assume that the conditions of Lemma \8. 1\ hold, then 


sup 

tel' 


- Kt)) eiKb^{ti-t) + 

^ i=l ”■ 

- ^i,b„(^)^3,fe„(i)) = 0{hl + ^), 


where = [0, 6 „] U [1 - 1], Uj,bAt) = !-t/hY x^K{x)dx and c{t) = iyo,bAt)j^ 2 ,bAt) - ^^IbA^)- 


Proof. For any t E [0, 6 „] U [1 — 1], using fIS.ip . we obtain 


i Er=i[^i - pit) - pit)iti - t)]KbAti -1) 
7k: Er=i[^i - pit) - Kt)iti - t)]Kb„iti - t)C-j^) 

and a Taylor expansion yields 


Snit) 


Pbnit) - 
bniPbn it) 


pit) 

-Kt)) 


Snit) 


Pb„it) - pit) 


nbn Ei=l 

[U -() + 

kiPbnit) - pit))_ 


Ei=l ^i^bniti 



+ 0{bl + bn/n) 

( 8 . 2 ) 


uniformly with respect to t E [0, bn] U [1 — 6 ^, !]• On the other hand, uniformly with respect to 
t E [0, bn] U [1 — bn, 1], we have that 


Snit) 


^o,bAt) 

^hb^t) 


^l,b„it) 

^2,b„it) 




Therefore, combining fl8.2l) and fl8.3p . it follows that 


(8.3) 


1 +. _ f 

cit)iPbAt) - pit)) = ^Y [^^,bnit) - l^l,bAt){k—)ki^bAti - t) + 

2=1 

^pit)klbAt) - ’^i,bAt)j^ 3 Kit)) + o(bl + 

uniformly with respect to t G [0, U [1 — 1]. □ 

The next lemma concerns the order of deviations of from p in the || ■ || 4 -norm. 

Lemma 8.3. Assume that assumptions (Al)-(Af) are satisfied and that nb^ —)■ 00 , nb^n 0; 
























then 


sup Wf^bAt) - t^{t)h = 0{bl + {nb^) 
te[o,i] 


sup l/iftjt) -p(t)| 
te[o,i] 


0(ti + 


(8.4) 

(8.5) 


Proof. Observing the stochastic expansion in Lemma [HU] we hrst evaluate || YAi=i ^i^bniP —1)||4 
uud 11^ YAi=i ^i^bnifi ~ ^)I|4- Recalling the definition of Vi we note that 


-t) 

i=l 


EE sE E Vi—k^i^bn (fi t') 


i=l k=0 


k=0 i=l 


Since for each k, Vi-keiK^Ati — t), lE^Enisa martingale difference sequence, it follows from 
Burkholder’s inequality 


^ ^ Vi—kOiKhn ifi 

i=l 


t) .<c\\('£{Vi —k^i^bniti ^)) ) 


2 \ 1/2 


2=1 


2 

4 


n n 

< CY,\\i'P^-ke^K,At^-t)f\\2 = CY,m-ke^K,At^-t))\\l 

2=1 2=1 


and condition (A4) implies || YAi=i'^i-k^i^bA'ti ~ ^)||4 = 0{AnbfxAi uniformly with respect to 
t e [0,1]. This yields 


sup [[ y^^ejKbAtj - t) 

i6[0,l] " i=i 


0{^/nIA)- 


( 8 . 6 ) 


Similar arguments show sup^gjo^i] ||^ YAi=i ^iKbA^i~^)\\i — 0{\/nbJ)A), and by Proposition 18.1! 
in Section 18.21 it follows that 


II sup I -t)/(n6„)III4 = 0((n6„) ( 8 - 7 ) 

te[o,i] 

and by Lemma [8.11 we obtain 


sup 

tG^n 


(hM - - (- 1 E- *) + 


2=1 


2 ^ Anhfb^^'^ 


Xn I 2 1 

+ Xn h 


where Xn = b^ + bn/n. Hence || sup^g.j^(t) — p(f))^||2 = 0(^72 + ^n)- Ry similar argument 

^ 22677 , 
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and Lemma 18^ it follows that || sup^gy ~ /^(^))^l |2 = 0 (—172 + and a combination of 

the last two estimates gives fIS.Sp . On the other hand, Lemma [8.II and fl8.6p also imply 


sup 




i=l 






1 


which further yields sup^g^ Il(h6„(^) “ /^(^))^I |2 = 0{{-7= + b'iy). Similar arguments show the 
estimate sup^g^/^ IKAbn(^) ~ /^(^))^I |2 = —I" bn)"^), which proves the remaining estimate 

M . " □ 

The following results give a uniform bound for the p-mean of (T^(f) — cr^(f), where is 

the variance estimator dehned in fl3.9p . We begin with a uniform asymptotic stochastic expansion 
for the difference d^(f) — cr^(f). 

Lemma 8.4. Suppose that Assumptions (Al)-(A4) are satisfied, Cn 0, ncn —)■ 00 , and the 
variance function is strictly positive, twice differentiable with a Lipschitz continuous second 
derivative d^. Then the estimate d^ = d^^ defined in fl3.9p satisfies 


sup 


a\t)-a\t) 


fi2(j {t)c„ 


n 


— y{ei-W.ei)K^SU-t) 


nCr 


i=l 


= 0 


Cn 

+ — 
n 


(8.8) 


sup 

tex' 


c{f){a‘^{f) -a'^it) -^ 

nCn ^ L 


ti -1 


le^,-E(el)]K,JU-t)+ 


Z =1 


- 0,c„(t)z^3.c„W) = 0(^4 + ^), (8.9) 


where c{t), r'yc„(^) cire defined in Lemma lKR = [c„, 1 — c„] and = [0,c„] U [1 — c„, 1]. 
Proof. Following the argument given in the proof of Lemma 18.11 we have that 


Sn{t) 


{a\t)-a\t)) 
Cn(d^(t) - a‘^{t)) 


^ - t))Kc„{u -1) 

Er=i(e? - - a^itfiU - t))C-^)K^^{U - t) 


where Sn{t) is defined in the proof of Lemma ISTl The lemma now follows by the same arguments 
as given in the proof of Lemma [8. II and Lemma [8.21 which are omitted for the sake of brevity.□ 


Corollary 8.1. Suppose that the conditions of Lemma 8.4 hold with l>S, then 


sup ||d^(t) — (J^(f)||4 

i6[0,l] 


o ci + 


1 r.2 1 

+ bl + 


yjnc. 


^/nbr, 
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Proof. By flS.Sp . we have for some large constant C, 


snp \\a‘^{t) - a^{t)\U < Ccl + snp -- ^el)KcSU - t) 


tG.'Jr, 


t£Tn 


i=l 


< 


Cc„ + snp 

t&Tr, 


1 

nCr, 


+ snp 
4 teTn 


- Ee?)/C.((, - t) 

i=l 

It is easy to verify that the hrst term satishes 

n 

-5^(e- - Ee^i)Kc^{ti - t) 

ncn ^ 

1=1 

By the proof of Lemma [8.31 we obtain (note that t > 8) 

sup ||/ib„(t) -hWlls = o(bl + 

te[o,i] ^ 


nCr 


5 ^( 6 ? -t) 


i=l 


snp 

teTn 


= O 


y/nc^ 


y/nhf/' 


which yields (note that Cj = Cj + /i(L) — Pb^iti)) 


snp 

t&Tn 


1 

nCn 








Hence snp^gj^ l|u^(f) ~ u^(f )||4 = 0{c^ + -y= + 6^ + :y=^)- Similarly, nsing the estimate fl8.9p 
we obtain snp^g^-/ ||(T^(f) — c’'^(f )||4 = 0(c^ + + &^ + :y^)) which completes the proof. □ 


Corollary 8.2. Suppose the conditions of Lemma 8.f hold, with l> 8. Then 


snp |d^(f) — (T^(f)l 
* 6 ( 0 , 1 ) 





Proof. The lemma follows from Proposition 18. II in Section |821 the triangle ineqnality and simple 
calcnlations. Note that the hrst assnmption of Proposition 18.11 is satished by the argnments 
in Corollary 18.11 The second assnmption regarding the derivative can be shown by similar 
argnments as given in (18.6p and (18.7p . □ 

Corollary 8.3. Suppose that the conditions of Corollaru \8. 21 hold. Let a = l — A/i — v for some 
u > 0, and l is defined in condition (A3) and condition (Af). Then we have 


snp Wnit) - U^(t)| = Op{pn) , 

*e[0,4„-n-“]U[l„+n-“,l] 
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where pn = n ^Cn'^ + cl + + bl + 

Proof. Define 


ok _ 

^n,l — 


T ^ ^r) 

1=1 


ti - t\i 




2=1 


ti - tP 


^nl = - E 

’ nc„ ^ 


ti-tP 


i=k 


wju-t), Ri^ = — j2 

nc„ ^ 


ti -1 


KcSti-t)el, 


K^^{ti-t)el 


i=k 


Recall the definition of cr^*{t) in fl3.12p . With the notation k* = [nt*J elementary calcnlations 
yield 


( 8 . 10 ) 

52(«) ■= 52%-+i(«) = (Co'''(«)SSy‘(i) -Ct‘(i)Ct'(«))/Ko"‘(*)Sh'"'(«) - (SS:t‘(«))")- 


We will show below that 


snp snp \^lh{t)-(jl{t)\=Op{pn), 

\iv—th\<n~°‘ te[0,tt,—n““] 

snp snp = Opipn)- 

\tv—th\<'n~°‘ te[tt,+n-“,l] 


The assertion of Corollary 18.31 then follows since f* — = Op{n “). For a proof of the two 

remaining estimates we assnme withont loss of generality th > ty and define 


= —(E ' 

nCn V ^ 

2=1 




KcAU-t)el+ ^ 

i=\niv+l\ 


ti-tP 


Kc„{ti-t)el 


where + 1 < f < th} are independent of G Z} with bonnded Lth moment, 

E(ej) = 0, Var(ej):=d^(fj) snch that a^{ti) are Lipschitz continuons in and cr^(f+) = 

). Define 

Su.(«) = (K%(*)st2(«)-Kb(<)sti(‘))/(sto(*)s;,2(0-(s;,,(())")■ (8.11) 
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Using similar arguments as given in the proof of Corollary 18.21 it follows that 


sup sup \al^{t)-al{t)\ = Op(cl++ + (8.12) 

+ ^ ^ V \/nOn‘' ' 

On the other hand, by Proposition 18.11 we can show that for / = 0,1 

sup sup 

\niv\ + te[0,l„—n““] 

Combining (I8.inp and (18.111) . the last estimate implies that 


sup sup -dl{t)\=Op{n 

[nivl <h< [ni-ul + [nl““ J te [0,t„ —n““] 

By fimi) we have 


sup sup Wlhi^) - = Op (pn), (8.13) 

\nty\<h<\niv\ + \n^-°‘\ 

and it is easy to see that 


sup sup = Op{pn) 

sup _ sup \^l,hit)-(Tl{t)\=Op{pn), 

[nivl — [nl““J<h<[ni„JJ te[0,i„—n““] 

sup _ sup - (^ 2(^)1 = Op (pn). (8.14) 

— \<h<[niv\ t&[t-u+n~°‘,1] 

The assertion now follows from fl8.13p - fl8.14l) . the definition of cT^*(t), and the fact that — = 

Op(n““). □ 

8.2 Two additional technical results 

Proposition 8.1. Let {T„(t)}ig[o,i] he a sequence of stochastic processes with differentiable paths. 
Assume that for some p > I and any t G [0,1], ||T„(f)||p = 0(m„), ||T„(f)||p = 0{ln), where 
mnjn are sequences of real numbers, mn = Oifn), then 


sup |T„(f)| 
ie[o,i] 


V 





In particular, if p = 2, we have || sup^g^,!] I^n (^)|||2 = 0{^/mflf). 
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Proof. For a sequence bn define bn = [bn and let r* = ibn, i = 1,2,...,bn and r* = 1 for 
i = bn + 1. Then by the triangle inequality, we have 


sup |T„(t)| < max |T„(ri)| + max Zin, 

te(0,l) 0<i<b„+l l<i<6„+l 


where Zin = ^^PTi-br,<t<ri l^n(^) — ^n{Ti)\. Observing the inequalities 


\ 7- W 

I ^tn\\p 


\r{t)\dt < / \\Tnmpdt = 0{bnln) 


'P J Ti—bn 


and maxi<.<^^^^ we have 

max = 0 {lnb^r^dP)_ 

l<i<b„+l P 


Similarly, we obtain the estimate || maxQ^^^^^_i_JT„(tj)|||p = Op{bn^^^mn), and picking bn = 
rrin/ln proves the assertion. □ 


Proposition 8.2. Suppose An are sets such that —)■ 0 as n ^ oo, and XnI{An) = Op{l). 

Then Xn = Op{l). 

Proof. For any e > 0, let iV be a large constant such that P(An) < e/2 for n > N, and M be a 
large constant such that P(|X„|/(A„) > M/2) < e/2 for n > N. Then 

P(|X,| > M) < P(|X„|J(kl„) > M/2) +P(|X„|/(^) > M/2) 

< P(kl,) + ¥{\Xn\IiAn) > M/2) < e 

for all n > N. □ 
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